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PRELIMINARY AMENDMENT 

Assistant Commissioner for Patents 
United States Patent and Trademark OfiSce 
Washington, D.C. 20231 

Sir: 

Prior to the examination on the merits, please amend the above-captioned patent 
application as follows: 
IN THE SPECIFICATION : 

On page 1, line 4, after the Title of the Invention please insert -This invention is a 
continuation of PCT/GB98/01531 filed May 27, 1998—. 
IN THE CLAIMS : 

Please amend claims 4-13, 15-17, 19-23, 2, 28-29, 32-35, 37-41, 43-47. 

Claim 4 (Amended). The method of [any one of claims 1-3] claim 1 wherein the 
breed determinant is an overt phenotypic trait that is a behavioural or morphological trait. 

Claim 5 (Amended). The method of [claim 3 or] claim 4 wherein the overt 
phenotypic trait varies qualitatively or quantitatively between breeds. 
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Claim 6 (Amended). The method of [any one of the preceding claims] claim 1 
wherein the breed determinant gene analysed in step (ii) is selected from [any of] the ^roup 
consisting of : 

(a) a coat [colour] color gene; [and/or] 

(b) a coat pattern gene; [and/or] 

(c) a coat texture gene; [and/or] 

(d) a coat density gene; [and/or] 

(e) a coat length gene; [and/or] 

(f) an ear aspect gene; [and/or] 

(g) a double muscling gene; [and/or] 

(h) a horn morphology gene; [and/or] 

(i) a tusk morphology gene; [and/or] 
(j) an eye [colour] color gene; [and/or] 
(k) a plumage gene; [and/or] 

(1) a beak [colour] color/ morphology gene; [and/or] 
(m) a vocalization (e.g. barking) gene; [and/or] 
(n) a comb or wattle gene; [and/or] 
(o) a gene controlling display behaviourf.] ; and 
(p) combinations thereof . 

Claim 7 (Amended). The method of claim 6 [(a)] wherein the coat [colour] color 
gene is the KITov aMSHR gene [(for example, the pig KIT or aMSHR gene)]. 
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Claim 8 (Amended), The method [of any one of the preceding claims] of claim 1 
wherein the sample is a nucleic acid sample and the analysing step (ii) comprises DNA or RNA 
analysis. 

Claim 9 (Amended). The method of [any one of claims 1-7] claim 1 wherein the 
sample is a protein sample and the analysing step (ii) comprises protein analysis. 

Claim 10 (Amended). A method of determining the coat [colour] color genotype 
of a pig which comprises: 

(i) obtaining a sample of pig nucleic acid; and 

(ii) analysing the nucleic acid obtained in (i) to determine which [allele or] alleles 
of the [aMSHR] aMSHR gene [is/] are present. 

Claim 11 (Amended). The method of claim 8 [or claim 10] wherein the analysis 
step (ii) [comprises] is selected from the group consisting of : 

(a) selectively amplifying a specific fragment of nucleic acid [(e.g. by PCR); 

and/or]; 

(b) testing for the presence of one or more restriction endonuclease sites within 
the breed determinant gene(s) [/aMSHR gene (e.g., restriction fragment length polymorphism 
(RFLP) analysis); and/or]; 

(c) determining the nucleotide sequence of all or a portion of the breed 
determinant gene(s) [/aMSHR gene; and/or]; 

(d) probing the nucleic acid sample with an allele-specific DNA or RNA probe; 

[and/or]; 
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(e) carrying out one or more PGR amplification cycles of the nucleic acid sample 
using at least one pair of suitable primers and then carrying out RPLP analysis on the amplified 
nucleic acid so obtained : and 

(f) combinations thereof . 

Claim 12, line 4, delete "colour" and insert -color-; 

line 5, after "protein" insert 
Claim 13 (Amended). The method of claim [9 or claim] 12 wherein the analysis 
step (ii) [comprises] is selected from the group consisting of : 

(a) probing the protein sample with an antibody [(e.g. a monoclonal antibody)] 
specific for an allele-specific epitope; [and/or] 

(b) electrophoretic analysis; [and/or] 

(c) chromatographic analysis; [and/or] 

(d) amino-acid sequence analysis; [and/or] 

(e) proteolytic cleavage analysis; [and/or] 

(f) epitope mapping; [and/or] 

(g) translating a copy of the DNA or RNA of the gene produced by PCR or other 
means in an in-vitro transcription/translation syste m; and 

(h) combinations thereof . 

Claim 15, line 1, delete "or claim 14". 
Claim 16, line 1, delete "or claim 11". 

Claim 17 (Amended). The method of [any one of claims] claim 7[, 10, 1 1, 14 and 
15] wherein the analysis step (ii) further comprises determining the association between one or 
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more microsatellite or other linked marker alleles linked to the KIT or aMSHR gene and to 
particular alleles of the KIT ox aMSHR gene. 

Claim 19, line 1, delete "7" and insert --17--; and 

lines, delete "allele or". 
Claim 20, line 1, delete "colour" and insert -color--; and 

line 7, delete "allele or". 
Claim 21, line 1, delete "any one of claims 7, 10, 11 and 14-20" and insert -claim 

7-. 

Claim 22, line 1, delete "colour" and insert -color--. 

Claim 23 (Amended). The method of claim 22 wherein the additional coat 
[colour] color locus is the KIT gene locus [(e.g. the pig KIT gene locus)]. 
Claim 26, line 1, delete "colour" and insert -color-. 

Claim 28 (Amended). A method as claimed in claim 26 [or claim 27 wherein a 
sample of pig] wherein said pie nucleic acid is genomic DNA [is amplified] and further 
comprising the step of amplifying said genomic DNA using PCR and a pair of suitable primers. 

Claim 29 (Amended). The method of claim 21 wherein the additional locus is a 
breed determinant gene locus [selected from any of those genes specified in claim 6] selected 
from the group consisting of: 

(a^ a coat color gene: 

(h) a coat pattern gene; 

(c) a. coat texture gene: 

(di) a coat density gene; 

(e) a coat length gene; 
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(f) an ear aspect gene; 

(g) a double muscling gene; 

(h) a horn morphology gene; 

(i) a tusk morphology gene; 
(i) an eye color gene; 

(k) a plumage gene; 

(1) a beak color/morphology gene; 

(m) a vocalization (e.g. barking) gene; 

(n) a comb or wattle gene; 

(o) a gene controlling display behaviour; and 

(p) combinations thereof . 

Claim 32 (Amended). The method of [any one of claims] claim 7[, 10, 11, 14-23 
and 28-31] wherein the analysis step (ii) comprises PGR using at least one pair of suitable 
primers. 

Claim 33 (Amended). The method of claim 32 wherein the gene is the pig 
oMSHR gene and at least one pair of suitable primers is selected from the group consisting of : 

[aMSHR] aMSHR Forward Primer 1: (5'-TGT AAA ACG ACG GCC AGT RGT GCC TGG 
AGG TGT-3'); 

[aMSHR] aMSHR Reverse Primer 5: (5'-CGC CCA GAT GGC CGC GAT GGA CCG-30; [or] 
[aMSHR] aMSHR Forward Primer 2: (5'-CGG CCA TCT GGG CGG GCA GCG TGC-3') 
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[aMSHR] oMSHR Reverse Primer 2: (5'-GGA AGG CGT AGA TGA GGG GGT CCA-3'); [or] 
and 

[aMSHR] aMSHR Forward Primer 3: (5'-GCA CAT CGC CCG GCT CCA CAA GAC-3') 
[aMSHR] aMSHR Reverse Primer 3: (5'-GGG GCA GAG GAC GAC GAG GGA GAG-3'). 

Claim 34 (Amended). The method of [any one of claims 7, 10, 1 1, 14-23 and 28- 
33] claim 10 wherein the analysis step (ii) comprises restriction fragment length polymorphism 
(RFLP) analysis [for example involving digesting the pig nucleic acid with one or more of the 
restriction enzymes BstUl, Hhal and/or BspUi]. 

Claim 35, line 4, delete "lOlOof ' and insert -- 1010 of--. 

Claim 37 (Amended). The method of claim 36 wherein the gene is the pig 
aMSHR gene and the at least one pair of suitable primers is as [defined in claim 35] selected 
from the group consisting of: 

aMSHR Forward Primer 1: f5'-TGT AAA ACG ACG GCC AGT RGT GCC TGG AGG TGT - 

aMSHR Reverse Primer 5: r5'-CGC CCA GAT GGC CGC GAT GGA CCG-3'): 

aMSHR Forward Primer 2: (5'-CGG CCA TCT GGG CGG GCA GCG TGC-3') 
aMSHR Reverse Primer 2: (5'-GGA AGG CGT AGA TGA GGG GGT CCA-3''): and 

DC01:242648 

-7- 



PATENT 

Attorney Docket No.: 064727.0108 



aMSHR Forward Primer 3: f5'-GCA CAT CGC CCG GCT CCA CAA GAC-30 
aMSHR Reverse Primer 3: (5'-GGG GCA GAG GAC GAC GAG GGA GAG-3^) . 

Claim 38 (Amended). The method of claim 30 [or 3 1] wherein the gene is the pig 
KIT or aMSHR gene and the determination step comprises RFLP analysis [is as defined in claim 
28] in which pig genomic DNA is amplified using PCR and a pair of suitable primers . 

Claim 39 (Amended). The method of [any one of claims 1-9, 11, 13-19, 21-25 
and 29-38] claim 1 wherein the animal product is selected from the group consisting of meat 
[(e.g., processed and/or canned meat)], egg, egg swab or washing, semen, wool [or] and leather. 

Claim 40 (Amended). The method of [any one of claims 1-9, 11, 13-19 and 21- 
39] claim 1 wherein the sample comprises genomic DNA, RNA or mitochondrial DNA. 

Claim 41 (Amended). The method of [any one of claims 1-9, 11, 13-19 and 21- 
40] claim 1 wherein the animal is selected from the group consisting of [a] mammal [(e.g. pig, 
cattle, dog, cat, horse, sheep rodent or rabbit)], fish [(e.g. salmon or trout) or] and bird [(chicken 
or turkey)]. 

Claim 43, line 1, delete "colour" and insert -color-. 
Claim 44, line 1, delete "claim 42 or". 

Claim 45 (Amended). A kit as claimed in [any on of claims 42 to 44] claim 43 
comprising one or more reagents for carrying out at least one cycle of PCR together with at least 
one pair of suitable primers. 

Claim 46 (Amended). A kit as claimed in claim 45 wherein the [atleast] at least 
one pair of suitable primers is selected from the group consisting of : 
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[aMSHR] aMSHR Forward Primer 1: (5'-TGT AAA ACG ACG GCC AGT RGT GCC TGG 
AGG TGT CCA T-3') 

[aMSHR] aMSHR Reverse Primer 5: (5'-CGC CCA GAT GGC CGC GAT GGA CCG-3'); [or] 

[aMSHR] aMSHR Forward Primer 2: (5'-CGG CCA TCT GGG CGG GCA GCG TGC-3') 
[aMSHR] aMSHR Reverse Primer 2: (5'-GGA AGG CGT AGA TGA GGG GGT CCA-3'); [or] 
and 

[aMSHR] aMSHR Forward Primer 3: (5'-GCA CAT CGC CCG GCT CCA CAA GAC-3') 
[aMSHR] aMSHR Reverse Primer 3: (5'-GGG GCA GAG GAC GAC GAG GGA GAG-3'). 

Claim 47 (Amended). A kit as claimed in [any one of claims 42 to 46] claim 42 
which comprises one or more reagents for RFLP analysis of pig nucleic acid. 

Please add the following new claims: 
* - 48. The method of claim 34 wherein said RFLP analysis comprises digesting pig 
nucleic acid with a restriction enzyme selected from the group consisting of BstUl, Hhal, BspRl 
and mixtures thereof. 

49. The method of claim 39 wherein said meat is selected from the group 
consisting of processed meat, canned meat, and mixtures thereof 

50. The method of claim 41 wherein said mammal is selected from the group 
consisting of pig, cattle, dog, cat, horse, sheep, rodent and rabbit, said fish is selected from the 
group consisting of salmon and trout, and said bird is selected from the group consisting of 
chicken and turkey. 
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REMARKS 

Applicants believe that no new matter is introduced in the filing of this 
Preliminary Amendment. 

Applicants respectfully request that these amendments be entered in the subject 
application prior to examination. 



Respectfully submitted, 



Date: November 30, 1999 

Baker & Botts, L.L.P. 

The Warner; Suite 1300 

1299 Pennsylvania Avenue, N.W. 

Washington, D.C. 20004-2400 

Tel.: (202)639-7700 

Fax: (202)639-7890 
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METHODS FOR ANALYZING ANIMAL PRODUCTS 

The present invention relates to methods for analysing animals and their products. In 
5 particular, the invention relates to methods for differentiating animal products on the 
basis of breed origin, determining or testing the breed origin of an animal product and 
for validating an animal product, as well as to kits for carrying out such methods. In 
addition, the present mvention provides methods for the determination of pig genotype 
with respect to coat colour. 

10 

Introduction 

Animal breeds 

For thousands of years, selective pressure has been applied by humans in the course of 
15 animal husbandry to produce livestock exhibiting certain desirable characteristics. 
These characteristics have been selected to meet aesthetic, technical, ritual, social and 
economic needs. The result has been the production of a large number of different 
animal breeds. 

2 0 The term "breed" is a term of art used to define a homogenous, subspecific group of 
domestic livestock with definable and identifiable extemal characteristics that enable it 
to be separated by visual appraisal firom other similarly defined groups within the same 
species. The term therefore defines a group of animals to which selective pressure has 
been applied by humans to give rise to a uniform appearance that is inheritable and 

2 5 distinctive with respect to other members of the species. 

As breeds become established, their integrity is maintained by breed societies, 
herdbooks and pedigree records. 

3 0 Breed selection 

Conventional breed selection methods are based on direct measurement of the 
phenotype of an animal and/or its relatives. Thus, the implementation of breeding 
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schemes requires extensive phenotypic record keeping. For example, dairy herd 
improvement programs in the United States and Westem Exirope relied in part on the 
collection of individual records (milk yield and composition, type traits, health traits, 
etc.) performed on a monthly basis for millions of cows. Likewise, breeding companies 
5 carefully monitor their pig and poultry breeding stock for a whole range of phenotypic 
measurements. 

However, some important characteristics are not immediately apparent at the level of 
the living animal. For example, many parameters of meat quality are determined by 
10 subtle physiological or biochemical characteristics which are not readily apparent and 
so cannot serve as the basis for efficient artificial selection. 

Breeding for qualities of this type has relied in part upon selection for other (more 
readily apparent traits) which are to some extent coinherited (linked or associated) v^th 
15 the desirable characteristics. For example, in the pig industry lop ears have in the past 
been associated with mothering ability and so have been used as a marker for this trait. 

Conventional breed selection methods are limited by the fact that some phenotypes are 
expressed only in one sex or at a specific developmental stage. Moreover, some 
2 0 phenotypes are difficult and costly to measure. Indirect detection of such phenotypic 
traits via DNA-based diagnosis (for use in marker-assisted selection or MAS) is 
therefore seen as a desirable altemative to direct measurement of phenotypic parameters 
(see Georges and Andersson (1996), Livestock genomics comes of age, Genome 
Research, Vol. 6: 907-921). However, the gene structure-function relationships 

2 5 underlying many of the desirable traits are often highly complex and not yet sufficiently 

well-established to make such an approach feasible in practice. 

Breed identification 

The definition of animal breeds is currently at a watershed. Whereas previously they 

3 0 have been defmed by overt physical characteristics and pedigree records, in the future 

as new breeds are developed fi-om specific breeding lines they will be defined by sets of 
DNA markers. The work described herein allows not only the most accvtrate approach 
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to breed determination currently possible in a range of products but also allows the 
integration in a common format of breed determinant information obtained through use 
of the present invention with that which will be used in the future. The present 
invention therefore allows not only the determination of source breed in the current 
5 environment but also links this to the development of future breeds and their unique 
identification. 

It is generally recognized that the only definitive way to identify a particular animal as a 
representative of a given breed is through its pedigree. Thus, despite the fundamental 

1 0 importance of overt phenotypic traits in the breeding process and in the mamtenance of 
breed purity, those skilled in the art generally consider that breed identity cannot be 
defmitively characterized on the basis of visual inspection of such traits. By way of 
example, the genetic factor causing the belt phenotype in pigs is dommant to the non- 
belted form. Thus, a belted animal may result fi:om an animal of a belted breed such as 

1 5 Hampshire being crossed vnth a non-belted breed. 

As stated in P/GiS A handbook to breeds of the world, V. porter, Hehn Inf Ltd, ISBN 1- 
873403-17-8, 1993, page 16, "What is a Breed?": 

2 0 Appearances can be deceptive: never judge a pig breed by its coat! 

However, in many circumstances breed identification on the basis of direct evidence of 
pedigree is difBcult or impossible. Thus, in practice, so-called "breed markers" may be 
used to determine breed identity. 

25 

The term "breed marker" is a term of art which defines a measurable characteristic 
which on the basis of empirical data appears to be breed specific. Breed markers include 
genotypic features such as DNA polymorphisms, chemical features such as protein and 
water contents of meats, epigenetic/biochemical features (such as protein 

3 0 polymorphisms), chromosome structure, gene copy number, DNA fmgerprinting, 

microsatellite analysis and RAPD DNA markers. 
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Other useful markers include breed determinants. The term "breed determinant" is used 
herein to indicate an overt phenotypic characteristic which is used (at least in part) as 
the basis of artificial selection during breeding programmes. It is used m 
contradistinction to the term "breed marker", which (as explained above) is used herein 
5 to define other characteristics which appear to be breed specific on the basis of 
empirical data. The term "breed determinant gene" is used to indicate a gene which is 
involved (at least in part) in the expression of the corresponding overt phenotypic 
characteristic. 

10 Some breed determinants (e.g. coat colour) have traditionally been used as breed 
"trademarks", and so have long served as an indication of pedigree (and breed identity). 
Other breed determinants that have also been selected for in breed development include 
features such as ear carriage, face shape and general anatomical conformation. The 
advantage of breed determinants relative to simple breed markers is the inseverable link 

15 between the characteristics of the breed and the determinant. 

Biochemical and genetic tests for breed identitv 

Many of the breed markers discussed above can be characterized using biochemical or 
genetic tests. Such markers include genotypic features (e.g. DNA polymorphisms), 
2 0 biochemical features (e.g. protein polymorphisms), chromosome structure, gene copy 
number, DNA fingerprints, microsatellite patterns and RAPD DNA markers. 

However, there are significant problems associated v^dth such tests, as discussed below: 

25 Tests based on the chemical composition of animal products (e.g. meat or seminal 
plasma) may be compromised by the fact that the chemical profile varies between sites 
in the animal (ie different muscles)and is affected by diet, age, sex and sample storage 
conditions. Moreover, the results obtained are usually quantitative in nature, leading to 
problems with interpretation and comparison between different test sites. 

30 

Tests based on protein polymorphisms are limited by the fact that the distribution of any 
given protein is unlikely to be uniform, so that the protein of interest is absent in certain 
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tissues. Thus, a number of different polymorphism markers may be required to check 
all products of interest for breed provenance. Moreover, such tests are based on 
antibody assays, and a significant investment is also required to develop the reagents for 
a specific antibody test. 

5 

Chromosome structure analyses are compromised by the high level of skill required for 
cytogenetical methodology and interpretation and the elaborate precautions and care 
required for sample preservation. Such markers are poorly applicable on anything but 
materials derived from living or newly deceased animals, 

10 

Classical DNA fmgerprinting is based upon regions of repeated DNA sequence that due 
to their structure show a large degree of variation in length within a population. Such 
regions are often present in a number of copies within the DNA of an individual, thus 
increasing the potential for individual variation. By separating firagments of the total 
15 DNA according to size and then defining the position (and so size) of the hypervariable 
region using a specific probe, a fingerprint of a series of bands for a particular 
individual can be obtained. A number of probes for hypervariable regions of DNA have 
been examined in pigs (including Ml 3 viral sequences and human minisatellite probes) 
and it is claimed that specific bands were found in each breed. 

20 

Random Amplified Polymorphic DNA (RAPD) markers are based upon PCR 
amplification of DNA fragments using primers of random sequence. Such reactions 
generally give rise to a number of DNA fragments which can be characterised 
according to size by gel electrophoresis. If the products of reactions based upon DNA 

2 5 from different breeds are examined there is the possibility of finding certain DNA bands 

which are breed specific. However, there is in most cases no direct link between the 
alleles of such repeat series present and the features determining the actual nature of the 
breed. This, combined with the hypervariable nature of these regions of DNA, results 
in them rarely being breed specific (similar alleles being found in a number of different 

3 0 breeds). As there is no link to the phenotype of the breed there is a greater risk that 

cross specific alleles could exist or arise in a breed, whereas this is unlikely with breed 
determinants as they define the phenotype itself Given the large number of populations 



6 

of animals of specific breeds that exist, extensive research would have to be carried out 
to exclude a DNA marker fi-om breeds other than that with which it is claimed to be 
linked. 

5 However, a major drawback with this approach is that RAPD markers are considered to 
be xinreliable and found to be subject to variation between laboratories. Such problems 
are exacerbated when samples of different types and history must be analyzed and 
compared. 

10 There is therefore a need for reliable breed markers which can be used as the basis for 
rapid and inexpensive methods for identifying the breed provenance of various animal 
products and for validating animal products (such as foodstuffs and semen for use in 
breeding programmes). 

15 It has now been recognized that breed determinants as hereinbefore defined (such as 
coat colour) have unexpected advantages as breed identifiers or breed specific markers. 
In particular, it has surprisingly been discovered that the use of overt phenotypic 
characteristics as the basis for selection over long periods of time has led to particular 
alleles becoming fixed in most breeds. Such breed markers can be used to provide 

2 0 industry standard profiles for a particular breed that has application to all materials 

derived fi:om a particular species. 

Thus, it has now been found that many breeds are in fact genetically homogenous with 
respect to breed determinant genes (as hereinbefore defined), so that these genes may 
25 serve as the basis of reliable breed-specific markers (contrary to the prejudice m the art 
mentioned earher regarding the utility of breed determinants per se, such as coat colour, 
in breed identification). 

Moreover, it has surprisingly been found that the nature of the breed determinant genes 

3 0 (or alleles thereof) underlymg any one breed determmant (such as coat colour) may be 

highly polymorphic. Thus, variation in breed determinant genes and/or alleles between 
different breeds may exist, notwithstanding the fact that the different breed determinant 
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genes/alleles may contribute to the expression of the same overt phenotypic 
characteristic. 

Prior to the present invention, it was assumed that the corresponding genetic 
5 determinants would be insufficiently polymorphic to provide a useful basis for 
distmguishing between breeds. For example, coat colour was known to be shared 
among different breeds of pig and (as mentioned above) was therefore not regarded as a 
good candidate for a breed specific marker. However, the present inventors have found 
that the alleles underlying the coat phenotype in such breeds are in fact highly 
10 polymorphic and often distinctive (and so useful as the basis for breed identification). 

Similar considerations apply to other overt physical traits (breed determinants), which 
may therefore be shared by different breeds while nevertheless associated with distinct 
genes/alleles in each breed. An example of this is seen in cattle exhibiting the double 

15 muscled phenotype. Work by Kambadur et alia (1997, Genome Research 7, 910-915) 
and Grobet et alia (1997, Nature Genetics 17, 71-74) illustrates that the double muscled 
phenotype of cattle is caused by mutations in the myostatin gene. However, in the 
Belgian Blue and Asturiana breeds, this gene contains an 1 1 bp deletion whereas in the 
Piedmontese breed a G to A transition is present. Thus, as with porcine coat colour a 

2 0 single selected characteristic is caused by a number of potential polymorphisms. 
However, the nature of the arisal and selection history for such overt physical 
characteristics leads to the fixation of particular alleles within the breeds contributing to 
the breed specific profile of determinants. 

25 In the light of these findings, it has now been recognized that genetic analysis of breed 
determinants (such as coat colour) provides an effective means for validating animal 
products (e.g. foodstuffs) and may advantageously be incorporated into animal product 
(e*g. food) processing lines to monitor and maintain product quality and into quality 
control protocols in the food industry. 

30 

Coat colour 

Pig breeds show a variety of coat colours and these are often associated with 
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particular production characteristics. For example, white is the predominant coat 
colour among European commercial breeds e.g. Large White and Landrace, and these 
breeds are associated with larger litters and good mothering ability. However, there 
are a number of commercially important coloured breeds, demonstrating a number of 
5 colours and combinations. The Duroc, associated with meat tenderness, is red, the 
Pietrain, a heavily muscled animal which produces a very lean carcass, is spotted, and 
the Hampshire, also heavily muscled, is black with a white saddle over its shoulders. 
In addition, there may be other useful local breeds which have traits of potential 
commercial interest, and which are coloured. For example, the Chinese Meishan 
1 0 breed has been imported into Europe and the US because of its very large litter size. 
The European Wild Boar is brown when adult and striped when juvenile, and this 
breed is utilised to satisfy consumer demand for traditional meat products. It is also 
claimed that other local breeds or landraces are important because of their adaptation 
to local environments, e.g. temperature, endemic diseases and local feedstuffs. 

15 

Coat colour is important to the pig industry for a number of reasons. Firstly, gross 
variation in appearance (i.e. a range of coat colours) of pigs claimed to be genetically 
consistent for traits other than coat colour can lead to questions about the consistency 
and quality of the animals in the mind of pig-producers. Thus, the coat colour of the 

2 0 pig is often used as a trademark of the breed and the breeders want to ensure that their 

animals breed true for colour. For example, in several markets, local, traditional, 
coloured breeds are marketed for their meat quality or in terms of the production 
system used to rear them. However, this is not a trivial task since the coat colour is 
controlled by a number of genes. The inheritance is also complicated by the presence 
25 of dominance and interaction between genes. There is also an application in the 
assessment of the purity of the genetics of traditional breeds used as the basis for 
modem synthetic lines and the confirmation of the derivation of the latter. 

Secondly, in a number of markets there is a preference for white skinned meat. This 

3 0 is due to the fact that pork is often marketed with the skin still attached, and skin firom 

coloured pigs, even if dehaired, can still exhibit colour, which can lead to negative 
perception by the consumer partly, since the surface of the meat may appear to be 
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spotted by mould. It is therefore necessary in these markets to remove the skin from 
such carcasses, entailing additional cost. For example, in the US, coloured carcasses 
are associated with approximately 1% skin defects requiring dehairing and skinning to 
remove pigment. As a result of this, coloured pig carcasses are generally discounted. 

5 

One example of the problem concems the presence of black pigmented spots 
occurring in production animals that are crossbreds between a white and a pigmented 
line. This may occur because the dominant white gene inherited from the white breed 
is not always fully dominant in the heterozygous condition which occurs in this cross. 

1 0 A possible solution to this problem would be to ensure that the production animals 
are homozygous for the recessive red allele present in breeds such as the Duroc. In 
this case the pigmented spots would be red instead of black and much less 
conspicuous. To achieve this one needs to breed the recessive red allele to 
homozygosity in both the white and pigmented line used for cross-breeding. 

15 However, this would be very difficult using phenotypic selection as selection for a red 
background colour in a white line coidd only be accomplished with very expensive 
progeny testing schemes. 

In addition, pig breeders would like to be able to be in a position to ensure consistency 
2 0 in breeding populations. Breeders may wish to ensure that progeny produced by 
breeding crosses were always white. Altematively, a breeder of Duroc or Hampshire 
pigs may wish to ensure that breeding crosses always produced the characteristic 
Duroc or Hampshire colouring. Traditional animal breeding practices have in the 
past, been used to attempt to eliminate untypical colour from pig lines. For example 

2 5 purebred breeders must submit potential boars for progeny testing in order to 

demonstrate that they are suitable for inclusion in the breeding herd. This procedure 
incurs significant cost, including the substantial delay to confirm sufficient matings 
and progeny have been produced before the animal can be used commercially. 

3 0 Therefore, selection based on a diagnostic DNA test for mutations in coat colour 

genes would be a major advance compared with phenotypic selection. 

Coat colour is determined by the action of a number of different gene loci. For 
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example, the gene determining whether a pig is white or coloured is designated / (for 
inhibition of coat colour). The version of the gene preventing the expression of any 
colour (I) is dominant to that which allows colour to develop (i). Traditional selection 
for white animals has reduced the frequency of z, but it still remains in the population of 
5 white heterozygous carrier animals. Recently, a number of structural differences in the 
alleles of the KIT gene were identified and found to be involved with this aspect of coat 
colour determination which allowed the development of methods of distinguishing 
between alleles at this locus. 



10 However, animals which carry two copies of the recessive allele, /, at this locus have 
non-white coat colours (Johansson-MoUer et al, Mamm. Genome, 7:822-830 (1996), 
WO-A-97/05278, the disclosure of which is incorporated herem by reference). Pigs of 
this type can be all one colour, such as the Duroc (which is red), or have combinations 
of colours (particularly spotted or striped or banded patterns, such as the Pietrain and 

15 Hampshire, respectively). Many other combinations are possible and are observed (see 
the table, below): 

Genotype Colour 
I/I White 
20 I/i White 

/// Coloured 

/// White with coloured spots 

/// White 

P/i White with coloured patches 

25 

The non-white colour in such animals may be varying shades of red or black. The type 
of colour expressed is determined by the action of a second gene which is designated E 
(for extension of coat colour). Based on the literature, animals which contain the E 
version of the gene are completely black, and this version of the gene is dominant to 
3 0 that which results in red coat colour (e). Patched or spotted animals, such as the 
Pietrain breed, contain a third version of the gene designated EF. This version of the 
gene is dominant to e but not to E, For example, black animals may have the 
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genotype, UEe, iiElF or iiEE, A black sow and a black boar which were both 
heterozygous at the E locus and which were of the genotype UEe would produce both 
black and red piglets in the ratio 3:1. The black piglets would be UEE or UEe and the 
red piglets would be nee, 

5 

The density and coverage of coat colour and the position of bands of white are 
determined by additional loci, one of which, the belt locus, is discussed later in this 
application. For example, the Hampshire breed is background black with a white band 
across its shoulders, the width of this band may vary, however, the colour should be 

10 black. There is evidence that some Hampshire animals are derived from herds that 
have been crossed previously with red breeds, such as the Duroc. In this situation, the 
red version of the gene can be maintained silently in the heterozygous state. When 
two heterozygotes are crossed 25% of the offspring will contain red. In some cases 
such pigs v^U have the appearance of the Duroc breed, being solid red, however, in 

15 other cases, the animals v^U have the white band inherited from the Hampshire and 
have the appearance of red Hampshires. It is the presence of the atypical coat colour 
rather than the pattern that is important in this situation. 

The extension locus is known in other breeds of domestic animals, such as the horse, 

2 0 where e is associated with chestnut colour (Adalsteinsson, J, Hered 65:15-20 (1974)), 

cattle (Klungland et al. Mammalian Genome 6: 636-639 (1995)), the fox 
(Adalsteinsson, J, Hered 78:15-20 (1987)) and the mouse (Jackson, Ann Rev Genet. 28: 
189-217 (1994)). The extension locus encodes the alpha melanocyte-stimulating 
hormone receptor (aMSHR). It has been shown that recessive alleles at this locus do 
25 not express a fimctional aMSH receptor (Robbins et al. Cell, 72: 827-834, Klungland 
et al. Mammalian Genome 6: 636-639 (1995)) and these workers have identified 
mutations in the sequence of the oMSHR gene in these species associated v^th 
different coat colours. 

3 0 Classical segregation analyses have identified a minimum of three alleles at the pig 

extension locus: E for uniform black, EF for black spotting and e for uniform red 
(Ollivier and SeUier, Ann. Genet. Sel Anim., 14:481-544, (1982)). The dominance 
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relationship among the three alleles is as follows E>EF>e. We have now found that 
these coat colour variations are associated with sequence polymorphism in the 
aMSHR gene in the pig. We have analysed the DNA sequence of this gene using 
samples from the following breeds with different coat colour: Wild Boar which is 
5 wild type coloured^ Meishan and Hampshire which carry alleles for uniform black 
{E), Pietrain and Large White which carry alleles for black spotting {W) and Duroc 
which is uniform red {e). In Large White the patches or spots of colour that might be 
expected due to the presence of the allele are hidden as this breed also carries the 
dominant white gene which prevents any expression of colour. Five different aMSHR 

10 sequences were obtauied one from the Wild Boar, one from Meishan, one from Duroc 
one from Hampshire, and one found in Pietrain and Large White. We have designated 
the allele found in the Wild Boar as and assume that the presence of this allele is 
necessary for the expression of the wild type colour. The E alleles for uniform black 
carried by Meishan and Hampshire pigs were associated with different aMSHR 

15 sequences. We have denoted these two alleles £^ and respectively. The DNA 
sequence associated with the allele for black spotting found in the Pietrain was 
denoted^. The similarity of the FF and £^ alleles suggests that they are derived from 
a common origin. The sequence differences presented here can be used as the basis of 
methods and kits to determine the genotype of pigs in relation to coat colour. 

2 0 Alternatively, alleles of linked markers, such as microsatellite or AFLP markers, 

found to be in linkage disequilibrium with these alleles could be used to predict colour 
genotype. In conclusion we have found fivedifferent aMSHR sequences associated 
with five different extension alleles i.e. 5"^, BF and 

25 Except for the 2 base pair insertion at the 5' end of the EF allele and the 1 bp deletion 
in the 3' untranslated region of £^ allele the DNA sequence differences identified in 
the aMSHR gene are single base pair changes. Some of these are silent, however, a 
number lead to changes in the amino acid sequence of the aMSHR protein. For 
example, the differences between e and the other alleles are two missense mutations 

3 0 in the coding sequence of the aMSHR gene. Importantly, the differences in the pig 

gene are different from that foxmd in other species. The cattle and mouse e mutations 
are one base pair deletions (Robbins et al. Cell, 72: 827-834 (1993); Klungland et al. 
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Mammalian Genome 6: 636-639 (1995), Joerg et al. Genome 7: 317-318 (1996)), 
whilst the mutations identified here include a missense mutation (G727 changed to A) 
in a region which is conserved among human, mouse, cattle and horse gene sequences 
(Wikberg et a/, WO 94/04674 (1994), Valverde et al. Nature Genet. 11: 328-330 
5 (1995), Robbins et ah Cell, 72: 827-834 (1993), , Klungland et ah Mammalian 
Genome 6: 636-639 (1995), Joerg et ah Genome 7: 317-318 (1996), Marklund et ah 
Mamm, Genome, 7:895-899 (1996)). The £F group has a dinucleotide insertion in the 
5' end of the gene after nucleotide position 66 of the Wild Boar sequence which leads 
to the creation of a stop codon further into the gene resulting in a predicted mutant 
10 polypeptide of only 54 amino acids. Finally, the Meishan allele (£^) shows four 
amino acid changes in the protein. Two of these differences are in the same region of 
the gene which is altered in cattle. 

The colours of a series of pig breeds, the classical genotypes for / and E and the 
15 determined genotypes for E based on sequencing and testing studies are shown in the 
table below: 



Breed 


Hocus 


E locus 


Colour 


Hampshire 


i/i 


t/t" 


Black with white belt 


2 0 Large White 


I/I 


EF/EF 


White 


Landrace 


I/I 


EF/EF 


White 


Pietrain 


i/i 


EF/EF 


White with black patches 


Berkshire 


i/i 


EF/EF 


Black with white points 


Meishan 


i/i 


ET/E^ 


Black or black with white points 


25 Duroc 


i/i 


e/e 


Red 


Wild Boar 


i/i 


E^/E^ 


Brown (banded hair) 



3 0 Thus, it is possible to distinguish between the alleles of E^, ET, Ef", EF and e and so 
determine the genotype of individual pigs (or the genetic provenance of products 
derived therefrom) with respect to non-white coat coloxir. Interestingly, the white 
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breeds that have been examined all appear to be fixed for alleles EF at the E locus. 
There is considered to be potentially some modifying effect of the E locus on the 
phenotype conferred by the / locus. While the basis of this is not established, the fixing 
ofEFm these lines illustrates the subtle effects on loci involved with coat colour upon 
5 selection for breed characteristics thus providing more determinants among such loci 
than might be expected. 

Associations can be determined between extension locus genotype and linked markers, 
eg microsatellite sequences which are linked to the gene, A number of microsatellite 
10 markers have been located to the region of porcine chromosome 6 to which the 
aMSHR gene has been mapped. 

A number of pig breeds characteristically show the belt phenotype consisting of a 
continuous white belt over the shoulders and white fore legs. Examples of breeds 
15 demonstrating this characteristic are the British Saddleback (derived fi-om the Wessex 
and Essex breeds) and the Hampshire, which show a white belt upon a black back 
ground and the Bavarian Landschwein characterised by a white belt upon a red 
background. The characteristic is controlled by a dominantly acting locus Belt 
designated Be for which there are thought to be two alleles (Legault 1997 in The 

2 0 Genetics of the Pig, Ed Rothschild M.F. and Ruvinsky A, Publ. CAB International) 

(OUivier and Sellier, Ann, Genet Sel Anim,, 14:481-544, (1982)). Be giving rise to a 
belt and be which in the homozygous form leads to the absence of a belt The 
heterozygous animal Be/be carries a belt but in this genotype the belt is generally 
narrower in character. 

25 

To identify the actual genetic basis of the belted and non belted phenotype studies were 
carried out using animals from a Pietrain x (Pietrain x Hampshire) cross. The Pietrain is 
be/be while the Hampshire is Be/Be, Thus the Fl generation all have the genotype 
Be/be, Further crossing of the Fl back to the Pietrain (be/be)lQ3ds to the segregation of 

3 0 the Be allele between offspring, giving rise to Be/be animals showing belts and be/be 

non belted offspring. A correlation was then established between the inheritance of the 
belted condition and certain microsatellite markers within these pedigrees. This work 
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surprisingly identified the actual gene involved as the ATT gene also described above as 
involved in dominant white. Further analysis showed correlation of the phenotype in 
this pedigree with a polymorphism at KIT nucleotide 2678 vdth a C or T occurring at 
this position. The presence of a C creates a restriction site for^cz I which is absent 
5 when a T is present. Based upon these unexpected findings a number of approaches 
can be taken to the determination of the genotype for an animal at the belt locus using 
either single nucleotide polymorphisms or linked markers including microsatellites or 
other single nucleotide polymorphisms. Thus animals can be genotyped by a number 
of approaches to determine their genetic status for this particular overt characteristic, 

10 

Summary of the invention 

According to the present invention there is provided a method for differentiating 
animals and animal products on the basis of breed origin, for deteraiining or testing the 
15 breed origin of an animal product or for validating an animal product, wherein the 
method comprises the steps of: (i) providing a sample of the animal product; and (ii) 
analyzing the allele(s) of one or more breed determinant genes present in the sample. 

As explained above, the breed determinant is an overt phenotypic trait. As used herein, 

2 0 an overt phenotypic trait is one which can be visually recognized. 

Differentiation of animal products on the basis of breed origin involves the partition of 
members of a class of different animal products into a number of different products 
sharing the same breed origin. It does not necessarily imply identification of the nature 
25 of the breed source. Animal product differentiation on this kind of basis may be 
sufficient where the consistency of source of animal products must be monitored (but its 
actual breed provenance is not important). 

In contrast, determination of the breed origin of an animal product impUes identification 

3 0 of the breed source, while testing the breed origin implies analysis sufficient to 

determine whether a breed source other than that desired has been used (v^thout 
necessarily identifying such other breed sources in cases where they are indicated). 
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Validating an animal product implies confiraiing that it meets stipulated specifications 
as to breed provenance. Such validation may involve differentiation, determination 
and/or testing, depending on the circumstances under which the analysis is performed 
5 and the nature and extent of ancillary data which may be available. 

The sample for use in the invention may be in any convenient form. In many cases, the 
sample will be a sample of a food (e.g. meat product). For most applications, the 
sample is pre-treated (e.g. extracted, purified and/or fi-actionated) in such a way so as to 

1 0 make the alleles of a breed determinant gene or genes available for analysis (either at 
the level of nucleic acids (such as RNA or DNA) and/or proteins). The sample is 
preferably a nucleic acid sample, in which case the analysing step (ii) comprises DNA 
or RNA analysis. Altematively, the sample may be a protein sample (where the nature 
of the protein reflects a breed determinant allele), in which case the analysing step (ii) 

15 comprises protein analysis. 

The breed determinant of the invention may be a monogenic or polygenic trait 
Monogenic traits are preferred, since the genes conferring such traits are relatively 
easily identified and analyzed. However, in some cases it may be usefiil to analyze the 

2 0 alleles of polygenic traits (i.e. traits which are controlled by a plurality of genes), since 

the underlying allele polymorphism is often greater in such cases (so increasing the 
potential for breed differentiation). 

Typically, overt phenotypic traits are those traits which have been used as the basis for 
25 artificial selection during the breeding programme. The overt phenotypic trait is 
preferably a behavioural or morphological, physiological or behavioural trait 

The overt phenotypic trait may vary qualitatively or quantitatively between breeds. 
Preferred are traits which vary qualitatively between breeds, since such traits are often 

3 0 reflected by qualitative differences in the alleles of the corresponding breed determinant 

gene(s). In such cases, analysis yields relatively robust positive-negative results, which 
are easily interpreted and compared between testing stations/laboratories. 
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The breed determinant gene analysed in step (ii) may be any suitable breed determinant 
gene. Such genes may be identified and analysed by methods well known in the art 
using routine trial and error. Preferably, they are selected from any of a coat colour, 
5 pattern, texture, density or length gene; an ear aspect gene; a double muscling gene; a 
hom morphology gene; a tusk morphology gene; an eye colour gene; a plumage gene; a 
beak colour/morphology gene; a vocalization (e.g. barking) gene; a comb or wattle 
gene; and/or a gene controlling display behaviour. 

10 In preferred embodiments, the breed determinant gene is the KIT or aMSHR coat 
colour gene (for example, the pig KIT and/or aMSHR gene). 

The analysis step (ii) may comprise any of a wide range of known nucleic acid/protein 
analytical techniques. The nature of the analytical technique selected is not critical to 
15 the practice of the invention, and those skilled in the art can readily determine the 
appropriate technique according to the circumstances in which the analysis is to be 
conducted and the type of data required. 

Preferably, the analysis step (ii) comprises selectively amplifying a specific fragment of 
2 0 nucleic acid (e.g. by PGR), testing for the presence of one or more restriction 
endonuclease sites within the breed determinant gene(s) (e.g. restriction fragment length 
polymorphism (RFLP) analysis), detennining the nucleotide sequence of all or a portion 
of the breed determinant gene(s), probing the nucleic acid sample with an allele-specific 
DNA or RNA probe, or carrying out one or more PGR amplification cycles of the 

2 5 nucleic acid sample using at least one pair of suitable primers and then carrying out 

RFLP analysis on the amplified nucleic acid so obtained, 

Altematively, the analysis step (ii) comprises probing the protein sample with an 
antibody (e.g. a monoclonal antibody) specific for an allele-specific epitope, 

3 0 electrophoretic analysis, chromatographic analysis, amino-acid sequence analysis, 

proteolytic cleavage analysis or epitope mapping. For example the BF allele might be 
distinguished by any method capable of detecting an alteration in the size of the 



encoded protein. 
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In particularly preferred embodiments, the analysis step (ii) comprises determining the 
nucleotide sequence of the KIT and/or aMSHR gene or the amino acid sequence of the 
5 KIT and/or aMSHR protein. Here, the analysis may comprise establishing the presence 
or absence of at le£ist one mutation in the KIT and/or aMSHR gene. Any method for 
identifying the presence of the specific sequence change may be used, including for 
example single-strand conformation polymorphism (SSCP) analysis, ligase chain 
reaction, mutagenically separated PGR, RFLP analysis, heteroduplex analysis, 
10 denaturing gradient gel electrophoresis, temperature gradient electrophoresis, DNA 
sequence analysis and non-gel based systems such as TaqMan™ (Perkin-Elmer), 

In the TaqMan™ system, oligonucleotide PGR primers are designed that flank the 
mutation in question and allow PGR amplification of the region. A third 

15 oligonucleotide probe is then designed to hybridize to the region containing the base 
subject to change between different alleles of the gene. This probe is labelled with 
fluorescent dyes at both the 5' and 3' ends. These dyes are chosen such that while in this 
proximity to each other the flourescence of one of them is quenched by the other and 
cannot be detected. Extension by Taq DNA polymerase fi-om the PGR primer 

2 0 positioned 5* on the template relative to the probe leads to the cleavage of the dye 
attached to the 5' end of the annealed probe through the 5' nuclease activity of the Taq 
DNA polymerase. This removes the quenching effect allowing detection of the 
florescence firom the dye at the 3' end of the probe. The discrimination between 
different DNA sequences arises through the fact that if the hybridization of the probe to 

2 5 the template molecule is not complete (i.e. there is mismatch of some form), then 

cleavage of the dye does not take place. Thus only if the nucleotide sequence of the 
oligonucleotide probe is completely complementary to the template molecule to which 
it is bovind will quenching be removed. A reaction mix can contain two different probe 
sequences each designed against different alleles that might be present thus allowing the 

3 0 detection of both alleles in one reaction. 

Although the TaqMan™ system is currently capable of distinguishing only two alleles. 
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labelled probe primer sets could be developed in which the probes for certain target 
allele(s) are labelled with a different fluorescent dye from non target alleles. For 
example, if one wished to confirm that a group of Duroc breed pigs carried only allele e 
one could have a probe present capable of detecting this allele labelled with one 
5 fluorescent dye and probes capable of detecting all the other alleles labelled with the 
second dye. Thus one would detect the presence of any non Duroc type alleles at this 
locus. Such probe sets could be designed and labelled according to the needs of the 
experiment. 

10 The analysis step (ii) may further comprise determining the association between one or 
more microsatellite marker alleles linked to the KIT and/or cxMSHR gene and to 
particular alleles of the KIT and/or oMSHR gene. 

Alternatively, the analysis step (ii) may be based on the identification of microsatellite 
15 markers present in the nucleic acid sample. 

The analysis step (ii) preferably comprises: (a) determining the association between one 
or more microsatellite marker alleles Imked to the KIT and/or oMSHR gene and to 
particular alleles of the KIT and/or oMSHR gene; determming which microsatellite 
2 0 marker allele or alleles are present in the nucleic acid sample. 

The analysis step (ii) preferably further comprises the step of determining the genotype 
of at least one additional locus, for example an additional breed determinant (e.g. coat 
colour) locus. Particularly preferred as an additional locus is the KIT gone locxis (e.g. the 
2 5 pig KIT gene locus). 

The analysis step (ii) preferably comprises PGR using at least one pair of suitable 
primers. In the case where the gene is the pig oMSHR gene, the at least one pair of 
suitable primers is: 

30 

aMSHR Forward Primer 1: (5'-TGT AAA ACG ACG GCC AGT RGT GCC TGG 
AGG TGT-30 
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aMSHR Reverse Primer 5: (5'-CGC CCA GAT GGC CGC GAT GGA CCG-3'); or 
aMSHR Forward Primer 2: (S'-CGG CCA TCT GGG CGG GCA GCG TGC-3') 
aMSHR Reverse Primer 2: (5^-GGA AGG CGT AGA TGA GGG GGT CCA-3'); or 
aMSHR Forward Primer 3: (5'-GCA CAT CGC CCG GCT CCA CAA GAC-3') 
5 aMSHR Reverse Primer 3: (5^-GGG GCA GAG GAC GAC GAG GGA GAG-30 

The analysis step (ii) may also comprise restriction fragment length polymorphism 
(RFLP) analysis, for example involving digesting the pig nucleic acid with one or more 
of the restriction enzymes BstUl, Hhal and/or BspiS., In cases where the gene is the pig 

10 aMSHR gene, this analysis may involve identification of a polymorphism at any of the 
nucleotide positions shown to be polymorphic including 283, 305, 363, 370, 491, 727, 
729, 1162 or between nucleotide positions 60 and 70 or between nucleotide 
positonslOOS and lOlOof the pig aMSHR gene. 

15 The analysis step (ii) may involve carrying out one or more PCR amplification cycles of 
the nucleic acid sample using at least one pair of suitable primers and then carrying out 
RFLP analysis on the amplified nucleic acid so obtained to determine the KIT or 
aMSHR genotype of the pig. Here, when the gene is the pig aMSHR gene the at least 
one pair of suitable primers is as defmed above. 

20 

The animal product preferably comprises or consists of meat (e.g. processed and/or 
canned meat), egg, egg swab or washing, semen, blood, serum, sputum, wool, biopsy 
sample or leather. It may comprise genomic DNA, RNA or mitochondrial DNA. 

25 The animal is preferably a mammal (e.g. pig, cattle, dog, cat, horse, sheep, rodent or 
rabbit), fish (e.g. sahnon or trout) or bird (e.g. chicken or turkey). 

The invention may be used with extremely small samples and can be used to screen 
large numbers of samples quickly and inexpensively. The invention may be adapted to 
3 0 yield absolute results, and quantification is not essential. Moreover, only small 
fragments of nucleic acid are required, and the same tests can be used on the majority of 
animal products. 
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Applications 

The invention finds application in a number of areas. For example, certain breeds are 
considered to yield meat of higher eating quality, and a number of retailers now market 
5 products which claim to be derived firom specific or traditional breeds (for example, 
Wild Boar crosses). The invention enables consumer organisations to validate these 
claims and also permits retailers to monitor the quality of the products with which they 
are being supplied (i.e. perform product validation). The invention finds particular 
application in validation studies carried out and used by retailers to support consumer 
10 confidence, since the linkage between a genetic marker and an overt physical feature is 
more readily grasped by the lay person than the concept of breed specific markers. This 
makes the use of such breed determinants attractive and also offers marketing 
opportunities for retailers to underpin vaUdation schemes. 

15 There are also a number of reports of breed influences on the quality of hams produced 
by various meat processing techniques. For example, in one report hams firom three 
different pig breeds were reliably classified on the basis of sensory descriptors of 
marbling, saltiness and dry cure flavour. The breed identification processes of the 
invention enables producers to validate raw materials as part of quality control, 

20 

The ability to enforce and validate raw material source uniformity also yields improved 
process control, lower costs and greater product consistency, since it has now been 
foimd that heterogeneity in chemical composition of products firom different breeds is 
an important factor in flavour profile variation and there may also be differences in the 

2 5 fimctionality of other meat components between breeds. 

The invention also finds utility in the maintenance of stock purity by animal (e.g. pig) 
breeders. The small size of traditional breed populations means that the maintenance of 
a gene pool of sufficient size to avoid the effects of inbreeding requires the importation 

3 0 and movement of stock between separate populations. A risk of genetic contamination 

is associated with such movements, and the invention may be used to reduce or 
elimhiate these risks. The maintenance of biodiversity and the rare breeds providing the 
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reservoir for this diversity provides an increasing need for breed identifiers. There is for 
example a problem for breeders of the British Saddleback. Certain bloodlines of this 
breed cany a higher firequency of the be allele of the belt lociis which can result in the 
production of belt-less animals which do not reach the required breed standard and 
5 decrease the value not only of that individual animal but of the whole litter. The ability 
to select against this allele when new bloodlines are introduced to an existing 
population would enable breeders to increase the genetic diversity with out the risk of 
lowering the relative standard of the particular population to that of the breed in general. 

10 The invention may also be used as part of a breeding programme to confirm particular 
crosses. This may be of enormous value in the establishment of pyramid breeding 
schemes. Particular breed characteristics such as coat colour, body shape and ear aspect 
are often altered in such crosses, yet there is a need to be able to confirm the presence of 
genetics of the desired parents. 

15 

Such visible breed characteristics for the visible confirmation of crosses are also absent 
in the use of artificial insemination, where semen may be supplied firom pigs in distant 
geographical locations. 

2 0 In addition, the skilled person will appreciate that based on the information described 
herein, it is possible to provide tests for determining pig genotype, with respect to coat 
coloxir. Thus, the present invention also provides a method of determining the coat 
colour genotype of a pig which comprises: 

2 5 (i) obtaining a sample of pig nucleic acid; and 

(ii) analysing the nucleic acid obtained in (i) to determine which allele or 
alleles of the aMSHR gene are present 

3 0 In one embodiment of this aspect of the invention the determination in step (ii) is 

carried out by determining the nucleotide sequence of the aMSHR gene and, in 
particular, is based on determining which missense, insertion or deletion mutation is 
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present in the coding region of the gene. 



In another embodiment one could first determine the association between 
microsatelUte or other linked marker alleles linked to the aMSHR gene and particular 
5 alleles of the aMSHR gene. Thus, the determination in step (ii) would be based on 
identification of microsatellite marker alleles present in the nucleic acid sample. 

In a fiirther aspect, therefore, the present invention provides a method of determining 
the coat colour genotype of a pig which comprises: 

10 

(i) determining the association between one or more microsatellite or 
other linked marker alleles linked to the aMSHR gene and particular 
alleles of the aMSHR gene; 

15 (ii) obtaining a sample of pig nucleic acid; and 

(iii) analysing the nucleic acid obtained in (ii) to determine which 
microsatellite or other linked marker allele or alleles are present. 

2 0 The determination of the alleles at the extension locus will indicate the background 
colour of the animal and in some cases the pattern of mixed colouration, i.e. spotting, 
but will not necessarily determine the coat colour of resulting progeny. This will be 
dependent on the genotype at other loci such as the dominant white locus, /. The 
genotype at the / locus can be determined separately as described in WO-A-97/05278. 

25 

Thus, suitably, the methods as described above may further comprise the step: 
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(iii)/(iv) determining the genotype of additional coat coloxir loci. 

An example of such an additional coat colour locus is the belt locus. 

In a preferred method PGR is carried out using primers that amplify a region of the KIT 
gene containing nucleotide 2678. An example of a suitable pair of primers is: 

forward primer 

LA93 5' - GAGCAGCCCCTACCCCGGAATGCCAGTTGA -3 ' 
and the reverse primer 

KIT56 5 ' - CTTTAAAACAGAACATAAAAGCGGAAACATC ATGCGAAGG - 
3'. 

The method of analysis enables determination of the presence of a C or T at position 
2678. Suitably, the restriction enzyme ^c/I can be used since the presence of a C 
creates a restriction site which is absent when a T is present. Similar examinations 
within a pedigree of will allow the determination of the genotype of offspring. 

Thus, in additional aspect, the present invention provides A method of determining the 
coat colour genotype of a pig which comprises: 

(i) obtaining a sample of pig nucleic acid: and 

(ii) analysing the nucleic acid obtained in (i) to determine whether the KIT 
gene carries any polymorphism associated with Belt genotype. 

Preferably, the method comprises RFLP analysis which is suitably carried out on a 
sample of pig genomic DNA which has been amplified using PGR and a pair of 
suitable primers. 

Preferred methods for identifying the presence of the specific sequence change are 
described above in relation to breed determinants. 
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Brief description of the Figures 
Figure 1 Partial nucleotide sequence (a) and the derived amino acid sequence (b) of 
the porcine aMSH-R gene as determined from a number of pig breeds. Position 
numbers for the nucleotide sequence are based upon nucleotide 1 being the A of the 
ATG initiation codon. Numbers of the amino acids are in accordance with the bovine 
BDF3 sequence (Vanetti et al FEBS Lett. 348: 268-272 (1995)) to allow comparison. 

Figure 2 : Agarose gel electrophoresis of DNA fragments obtained by digestion of 
DNA fragments amplified from the porcine aMSH-R gene with Bst\Jl or Hhal, Lanes 
labelled M contain DNA markers of 50, 150, 300,500, 750, lOOObp. The other samples 
were derived from: 

1. Pietrain 

2. Pietrain 

3. Large White 

4. Large White 

5. Large White 

6. Duroc 

7. Duroc 

8. Hampshire 

9. Meishan 

10. Berkshire 

1 1 . Berkshire 

Figure 3 : Agarose gel electrophoresis of DNA fragments obtained by digestion of 
DNA fragments amplified from the porcine aMSH-R gene with BstUl (lanes labelled 
B) or Hhal (lanes labelled H). Lanes labelled M contain DNA markers of 50, 150, 
300,500, 750, lOOObp, The other samples were derived from: 

L Retailer 1. Skin 

2. Retailer 1. Fat 

3. Retailer 1. Muscle 

4. Retailer 2, Fat 



5. Retailer 2. Muscle 



26 



Figure 4 : Electropherogram (4% agarose) showing RT-PCR products of KIT exon 16- 
19 with the primers KITIF and KIT7R. The samples 1-3 and 4-6 are Swedish Large 
White and Hampshire pigs respectively. The size difference between the 424 and 301 
bp fragments is due to lack of exon 17 in the latter fraction. The two upper bands of the 
Yorkshire pigs were interpreted as heteroduplexes (HD). 

Figure 5 : A 48 bp sequence is shown comprising 21 bp of KIT exon 17 and 27 bp of 
KIT intron 17. The position of the intron/exon border is marked with a vertical line and 
the sphce site mutation (ntl ^"^^) indicated with a vertical arrow. Identical bases in 
alleles / and i are marked with a dot. 

Figure 6 : Malll PGR RFLP test used to detect the presence of a splice site mutation in 
intron 17 of the KIT gene. Figure 6A shows tiie position of two Nlalll recognition sites 
within the PGR product amplified using primer pair KIT21 and KIT35. All distances 
are given in base pairs. Figure 6B shows the size of fragments which result following 
M^zIII digestion of either normal KIT or splice mutant KIT. Figui'e 6G illustrates use of 
the PGR RFLP test. Lane 1 shows the KIT21/KIT35 ampUfied fragment undigested. 
Digestion was performed on PGR products amplified from, in Lane 2: a clone which 
contains the splice site mutation; Lane 3: a clone which contains the normal splice site 
sequence; Lane 4: genomic DNA from a coloured pig; Lane 5: genomic DNA from a 
white pig. Fragment sizes are given in base pairs. 

Figure 7 : Gomparison of the ratio of normal to splice mutant KIT for three classes of 
genotype. 

Figure 8 : Gomparison of the ratio of normal to splice mutant KIT for two breeds of pig. 

Figure 9 : SSGP analysis of the KIT gene in Swedish Landrace (lanes 1-8) and Wild 
Boar (lanes 9 & 10) breeds. The two polymorphic bands are indicated. 
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Figure 10 : Nucleotide sequence of the porcine KIT cDNA from an animal of the 
Hampshire breed. The sequence is numbered with the first nucleotide of the N terminal 
methionine codon taken as 1 . 

Figure 11 : Polyacrylamide gel electrophoresis of PCR-RFLP analysis of KIT gene at 
polymorphic nucleotide 2678 in a number of animals. Lanes: 1 & 2, Hampshire Wild 
Boar respectively, both homozygous for the C at position 2678. Lanes 3-7 and 9 & 10, 
unrelated Large White sows all homozygous for T at position 2678. Lane 11, a Pietrain, 
homozygous for T at this position and lane 8 a Large White sow heterozygous for C 
and T. Lane 12 contains undigested PGR product and lane M DNA size standards. 

Figure 12 

Nucleotide sequence of the 3' end of the porcine oMSHR coding region and adjacent 
3' untranslated region. The TGA stop codon is highlighted in bold, the primer binding 
sites for EPIG14 is shown in italics. Numbering is based on the system used in figure 
la in which nucleotide 1 is the A of the ATG initiation codon of the Wild Boar 
sequence. Bases in common with the European Wild Boar are marked with a dash. 
Missing bases are marked with a :, 

Examples 

Example 1 : Determination of the sequence of the aMSHR gene 

The DNA sequence of the porcine aMSHR gene was determined through the DNA 

sequencing of a combination of PGR products and cloned portions of porcine DNA. 

Preparation of template DNA for PGR 

DNA can be prepared from any source of tissue containing cell nuclei, for example 
white blood cells, hair follicles, ear notches and muscle. The procedure here relates to 
blood cell preparations; other tissues can be processed similarly by directly 
suspending material in K buffer and then proceedmg from the same stage of the blood 
procedure. The method outlined here produces a cell lysate containing crude DNA 
which is suitable for PGR amplification. However, any method for preparing 
purified, or crude, DNA should be equally effective. 

Blood was collected in 50mM EDTA pH 8.0 to prevent coagulation. 50|il of blood 
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was dispensed into a small microcentrifuge tube (0.5ml Eppendorf or equivalent). 
450|iil of TE buffer was added to lyse the red blood cells (haem groups inhibit PGR) 
and the mix vortexed for 2 seconds. The intact white and residual red blood cells 
were then centrifuged for 12 seconds at 13,000 g in a microcentrifuge. The 
supernatant was removed by gentle aspiration using a low pressure vacuum pump 
system. A further 450jj,l of TE buffer was then added to lyse the remaining red blood 
cells and the white blood cells collected by centrifugation as before. If any redness 
remained in the pellet, this process was repeated imtil the pellet was white. After 
removal of the last drop of supernatant from the pelleted white blood cells, lOOjitl of K 
buffer containing proteinase K was added and the mixture incubated at 55 degrees C 
for 2 hours. The mixture was then heated to 95-100 degrees C for 8 minutes and the 
DNA lysates stored at -20 degrees C until needed. 

Reagents 

T.E. Buffer: lOmM TRIS-HCl pH8.0 

ImM EDTA 

K Buffer: 50mMKCl 

lOmM TRIS-HCl pH8.3 
2.5mM MgC12 
0.5% Tween 20 

PGR to produce DNA sequencing template 

The aMSHR gene was amplified for sequence analysis using three primer pairs. 

Primers MSHR Forward Primer 1 : (5'-TGT AAA AGG AGG GGG AGT RGT GGC 
TGG AGG TGT CGA T-S^; and 

MSHR Forward Primer 5: (5'-GGG GGA GAT GGC CGC GAT GGA CCG-3') 
amplify a 428 bp fragment from the 5' half of the gene. 
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Primers MSHR Forward Primer 2: (5'-CGG CCA TCT GGG CGG GCA GCG 
TGC-3*); 

and aMSHR Reverse Primer 2: (5'-GGA AGG CGT AGA TGA GGG GGT CCA-3*) 

amplify a 405 bp fragment the 3' half of the gene. 

As these two fragments are non-overlapping a third primer pair 

aMSHR Forward Primer 4 (5'-TGC GCT ACC ACA GCA TCG TGA CCC TGC-30; 

and 

aMSHR Reverse Primer 4 {5'-GTA GTA GGC GAT GAA GAG CGT GCT-3') 

were used to amplify a 98 bp fragment which spans the 50 bp gap. PCR was carried 
out on a DNA thermal cycler (Perkin Elmer 9600) in a total volume of 20 )il 
containing 25 ng genomic DNA, LO mM MgC12, 50 mM KCl, 10 mM Tris-HCl, pH 
8.3, 200 (M dNTPs, 0.5 U AmpliTaq Gold (Perkin Ehner) and 10 pmol of both 
forward and reverse primer. To activate AmpliTaq Gold, initial heat denaturation was 
carried out at 94 degrees C for 10 minutes followed by 32 cycles each consisting of 45 
sec at 94 degrees C, 45 sec at 53 degrees C and 45 sec at 72 degrees C. The final 
extension lasted for 7 min at 72 degrees C. PCR products were cloned into vector 
pUC18 using the SureClone ligation kit (Pharmacia). 

Preparation of plasmid DNA 

Plasmid DNA was purified from overnight bacterial culture using the Jetstar plasmid 
midi kit 50 (Genomed) and the resulting DNA diluted to 150 ng/jil. 

Sequencing of plasmid DNA 

Cloned plasmid inserts were sequenced using dye primer chemistry. Each cycling 
reaction was prepared with template and ready reaction mix containing fluorescently 
labelled Ml 3 forward or reverse primer as described in the ABI Prism protocol P/N 
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4021 13 (Perkin Elmer). Cycling and sample pooling was perfomied using a Catalyst 
800 Molecular Biology Workstation (ABI) following the instruments user manual 
(Document number 903877, Perkin Elmer). The resulting extension products were 
purified, loaded and analysed using the 377 ABI Prism DNA sequencer as described 
by the instrument protocol (Perkin Elmer protocol P/N 402078). 

Dye Terminator Sequencing of PCR products 

Dye terminator DNA sequencing requires purification of PCR product free from 
excess dNTPs and residual primers. This was achieved by passage of the template 
DNA through QiaQuick spin columns (Qiagen) before the purified DNA was diluted 
to 15 ng/jil. Dye terminator cycle sequencing was performed using AmpliTaq DNA 
polymerase FS in accordance with the ABI Prism protocol P/N 402078 (Perkin 
Ekner). Cycle sequencing reactions were performed in a total reaction volume of 10 
|il. This comprised 1.6 pmole of either the forward or reverse primer used to amplify 
the target fragment from genomic DNA, 20 ng of purified template DNA and 
terminator ready reaction mix (Perkin Elmer) which contains each of four dye 
terminators, dNTPs, Tris-HCl (pH 9.0), MgCl2, thermal stable pyrophosphate and 
AmpliTaq DNA polymerase FS. Cycle sequencing was performed with a GeneAmp 
9600 machine (Perkin Elmer) over 25 cycles, each consistmg of 10 sec at 96 degrees 
C, 5 sec at 50 degrees C and 4 min at 60 degrees C. Extension products were purified 
for gel separation usmg ethanol precipitation, loaded and run on a 377 ABI Prism 
DNA sequencer as described by the instrument protocol (Perkin Elmer protocol P/N 
402078). 

Results 

The partial coding region DNA sequence of the porcine aMSHR gene sequence from 
a number of pig breeds is given in figure la combmed with sequence determined in 
example 22. The derived amino acid sequence is shown in figure lb. 

Example 2: PCR-RFLP based discrimination of alleles at the E locus 



DNA preparation for PCR 



As in example 1 , 
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PCR 

Reactions were set up in a 20^1 reaction volume in thin walled 0.25ml tubes (Perkin 
Elmer) with the following components: 

20|al reaction volume: 
2jj,l template DNA 
1.5mMMgCl2 
2OO1L1M each dNTP, 

3pM each of forward and reverse primers 
0.5 U AmpliTaq Gold (Perkin Elmer) 

MSHR Forward primer 3 sequence: 5' GCA CAT CGC CCG GCT CCA CAA GAC 
3' 

MSHR Reverse primer 3 sequence: 5' GGG GCA GAG GAC GAC GAG GGA GAG 

3' 

The reaction tubes were placed on a Perkin Elmer 9600 thermal cycler preheated to 94 
degrees C and PCR carried out according to the regime below: - 
Initial denaturation step of 94 degrees C for 10 min. 
33 cycles: 94 degrees C - 45 sees 
53 degrees C - 45 sees 
72 degrees C - 45 sees 
The last cycle is followed by a final elongation of 72 degrees C for 7 mm. Samples 
are stored at 4 degrees C until required. 

Restriction Enzyme Digestion and Electrophoresis 

The PCR amplification product is 148bp in length. To test for polymorphism in the 
amplified products the reaction is split into two aliquots of lO^il each of which is 
digested with Hhal (GIBCO-BRL) or BstUl (New England Biolabs). The reactions 
are set up and incubated as below: 
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BstVI digest Hhal digest 

1 0|xl amplified DNA 1 0^1 amplified DNA 

2,5ixBst\]l 2.5^ Hhal 

60 degrees C 60 minutes 0.5|al lOx React 2 buffer (GIBCO-BRL) 

37 degrees C 60 minutes 



Following digestion, 2^1 of loading dye is added to each reaction (lOOmM Tris 
pH8.0, lOOmM Boric Acid, ImM EDTA, 50% (v/v) glycerol, 0.02% w/v Orange G) 
and the mixes loaded on a 4% agarose gel (3% NuSieve/1% Seakem, FMC 
Bioproducts) in 0.5x TBE (44.5mM Tris pH8.0, 44.5 mM boric acid and 0.5mM 
EDTA) and electrophoresed for 1 hour at 150v. 



Products are visualised by ethidium bromide staining. 



Results 

BstUI and Hhal digestion each result in bands of 61 and 87bp. The relationship of 
digestion to the possible allele is as shown in the table below: 



Relationsh ip of restriction digest profiles to individual alleles at the E locus 



Allele Digestion with BstUl Digestion with Hhal 

E^/EF/e" Yes Yes 

No Yes 
e No No 



If the uncut alleles are designated as allele 1 and the alleles digestmg with each 
enzyme as allele 2 the various genotypes will be as shown in the table below: 



Actual E genotypes and associated 
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scores 




Genotype 


BstUl 


Hhal 




1/1 


2/2 


E"/EF or E"/E^ 


1/7 




ET/e 


1/1 


1/2 


EF/EF or EF/E^ or E'/e:' 


2/2 


2/2 


EF/eorE'/e 


1/2 


1/2 


e/e 


1/1 


1/1 



Note: The results for animals carrying the allele ^ will be the same as those carrying 

Samples were prepared from a number of pigs and tested according to the above 
protocol. The results are shown in the table below and figure 2 illustrates the patterns 
seen upon electrophoresis. 

E genotypes determin ed for a range of breeds using the BstUl/Hhal digestion system 



Breed 


No Tested 


Genotype (see note 1) 


aMSHR type 








BstUl 


Hhal 


Hampshire 


9 


E^'/Ef' 


2/2 


2/2 


Large White 


4 


E^/EF 


2/2 


2/2 


Landrace 


1 


EF/EF 


2/2 


2/2 


Pietrain 


3 


EF/EF 


2/2 


2/2 


Berkshire 


2 


EF/EF 


2/2 


2/2 


Bazna 


4 


EF/ET 


1/2 


2/2 






EF'/EF' 


1/1 


2/2 


Duroc 


4 


ele 


1/1 


1/1 


Meishan 


3 


EF'/EF' 


1/1 


2/2 



Note 1, Thegenotype cannot be distinguished from or ^ in this particular test. 
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As can be seen from the results above the genotypes determined fit with those 
expected from the sequencing data given in figure la for Hampshire, Large White, 
Meishan and Duroc. The additional breeds typed here show the genotypes expected 
from their phenotype and descriptions in published literature (OUivier and Sellier, 
Ann. Genet. Sel. Anim., 14: 481-544, (1982)). The Pietram is a white breed with 
black patches of varying extent and has long been considered to be (in agreement 
with the result here). The Berkshire, originally a spotted breed, is now a mamly black 
animal with white 'socks' again generally considered to be £^ as was found here. The 
Landrace is a white animal due to it carrying the dominant white allele at the / locus, 
however its genotype at the E locus has been shown to be EF from classical breeding 
studies. Once again this is in agreement with the results obtained here. The Bazna is 
a Romanian breed having black base colour with a white belt It was developed from 
the Berkshire and Mangalitza, a Hungarian breed with a number of colour variations 
including black (Porter, Pigs, a handbook to breeds of the world, publ: Helm 
Information, ISBN 1-873403-17-8 (1993)). The ancestory of the Bazna being based 
upon a black breed potentially carrymg a similar allele to the Meishan, and the 
Berkshu-e carrying EF, is in agreement with the alleles found to be present in the breed 
in this work. 

Example 3: Validation of source breeds of retail meats 
DNA preparation 

DNA was prepared from different parts of pork chops from two separate retailers. 
The DNA was prepared from skin (1 retailer only), fat and muscle using the Promega 
Wizard Genomic DNA preparation kit according to the manufacturers instructions. 
Approximately 4mm^ of each tissue was cut into small fragments for the extraction. 

PGR and Restriction Digest Analvsis 

This was carried out exactly as in example 2. 
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Results 

The results are shown in figure 3. It can be seen that DNA extracted from a range of 
tissue types can be utilised for this DNA based test with results being obtained here 
for muscle, fat and skin. The genotype of the pig with regard to the (MSHR gene can 
then be determined. In this case the material from both retailers was derived from an 
animal of test type BstUl 1/2 and Hhal 1/2 using the nomenclature as in example 2. 
This translates into genotype EF/e or E^e, Based on our current knowledge of the 
distribution of the alleles in commercial pig breeds the conclusion can be drawn that 
both source animals contain genetic material derived from the Duroc. 

Example 4: Validation of source breeds of processed meat samples 

Method 

DNA was prepared from heat treated meat samples according to the method of Meyer 
et al (Journal of AOAC International, 78 1542-1551). Meat samples were minced 
with a scalpel and 0.3g transferred to a sterile 1.5ml eppendorf tube containing 430|ul 
of extraction buffer (lOmM Tris-HCl pH 8.0, 150mM NaCl, 2mM EDTA, and 1% 
w/v sodium dodecyl sulphate). Fifty microlitres of 5M guanidine hydrochloride and 
20|al of 20mg/ml protemase K (Boehringer) were added and mixed by inversion 
followed by incubation at 5TC for 3h. After digestion samples were centrifiiged fo. 
10 min at 13,000 x g, and 450ili1 of the aqueous phase added to 1ml Wizard DNA 
purification resin (Promega). The mixture was mixed by gentle inversion and 
following the Wizard DNA clean-up procedure carried out according to the 
manufacturers instructions the purified DNA was eluted with 50^1 of TO'^C water. 1 jil 
of a 1:10 dilution was then used as template in a lOjil PGR. 

PGR was carried out as described in the previous example. 

Results 

Meat samples from a Large White based line and a Duroc based line heated at 80°G 
for 30 mins could be differentiated on the basis of their genotype at the E locus with 
the Large White samples giving a pattem characteristic of the EF allele and the Duroc 
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samples a pattern characteristic of the e allele. 
Example 5: Validation of source breeds of semen 

Genomic DNA was isolated from porcine semen. 1ml of semen was centrifuged for 2 
min at 13,500 x g and the supematant removed. 1 ml of 2xSSC was added and the 
mix vortexed to resuspend the sperm. The mix was then centrifuged as before and the 
supematant removed. 400^1 of 0.2M NaOAc pH 7.0 was added and the mix vortexed 
followed by the addition of 34|il of fi-mercaptoethanol. The mixture was incubated at 
40*=C for 30 min followed by the addition of 100|il of 10% w/v sodium dodecyl 
sulphate and 50|il of 15 mg/ml Proteinase K (Boehringer) and further incubation at 
40°C for 3 hours. 500^1 phenol equilibrated with Tris-HCl pH 8.0 was added and the 
mix vortexed twice followed by centrifugation at 13,500 x g for 4 min. 400|il of the 
aqueous phase was removed and 800|il of ethanol added. DNA was allowed to 
precipitate for 5 min at room temperature followed by centrifugation at 13,500 x g for 
5 min. The pellet was washed with 800|il 70% ethanol v/v and air dried followed by 
resuspension in 200^1 of Wizard DNA resuspension buffer (Promega). 1 nl of a 1/10 
dilution was used in a 10|al PGR 

PGR was carried out as described in example 2. 

Results 

Semen form a Hampshire based Ime and a Duroc based line could be differentiated on 
the basis of their genotype at the E locus with the Hampshire samples giving a pattem 
characteristic of the £^ allele and the Duroc samples a pattem characteristic of the e 
allele. 

Example 6: Discrimination of allele from alleles EF/Ef" 
DNA preparation 

DNA was prepared as described in example 1 . 



PGR 
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Reactions were set up in a 20^1 reaction volume in thin walled 0.25ml tubes (Perkin 
Elmer) with the following components: 

lOjil reaction volume: 
2|al template DNA 
2.5mMMgCl2 
200^M each dNTP, 

5pmol each of forward and reverse primers 
0.5 U AmpUTaq Gold (Perkin Elmer) 

Forward primer sequence: 5* CTG CCT GGC CGT GTC GGA CCT G 3' 
Reverse primer sequence: 5' CTG TGG TAG CGC AGC GCG TAG AAG 3' 

The reaction tubes were placed on a Strategene Robocycler and PGR carried out 
according to the regime below: - 
Initial denaturation step of 94''C for 10 min. 
30 cycles: 94°C - 60 sees 

6rC - 60 sees 

72^C - 60 sees 

The last cycle is followed by a final elongation of 72^C for 7 min. Samples are held at 
6°C until required. 

Restriction Enzyme Digestion and Electrophoresis 

The PGR amplification product is 228 in length. To test for polymorphism in the 
amplified products the reaction is digested with BspHl (New England Biolabs). The 
reactions are set up and incubated as below: 

BspHl digest 

10|al amplified DNA 

lul lOx React 2 (NEB New England Biolabs) 
0.5 |il deionised water 
5 units BstUl 



60 minutes 



38 



Following digestion, 2|li1 of loading dye is added to the reaction (lOOmM Tris pH8.0, 
lOOmM Boric Acid, ImM EDTA, 50% (v/v) glycerol, 0.02% w/v Orange G) and the 
mix loaded on a 4% agarose gel (3% NuSieve/1% Seakem, FMC Bioproducts) in 0,5x 
TBE (44.5mM Tris pH8.0, 44.5 mM boric acid and 0.5mM EDTA) and 
electrophoresed for 1 hours at 150v. 

Products are visualised by ethidium bromide staining. 

Results 

BspHl digestion each result in bands of 124 and 104bp. The relationship of digestion 
to the possible allele is as shown below: 

Relationsh ip of restriction digest profiles to individual alleles at the E locus 



Allele 


Digestion with BspHl 




Yes 




No 



Samples were prepared from a number of pigs and tested according to the above 
protocol and the results are shown below: 

E genotypes determined for a range of breeds using the BspHl digestion system 



Breed 


No Tested 


Genotype (see note 1) 


Nvimber 


Wild Boar x 


3 




3 


Swedish 








Landrace 








Large White 


4 




4 


Landrace 


1 


E'/If 


1 


Pietrain 


3 


^/^ 


3 



39 



Note 1 . Where the genotype is listed this cannot be distmgtiished from £^ in this 
particular test. 

Example 7: Discrimination of cattle products bv breed 

DNA was prepared from cattle muscle samples as described in example 4. PGR was 
then carried out in a 100|il reaction using the primer pair: 

5'.TGAGGTAGGAGAGTTTTGGG-3' and 
5'-TCGAAATTGAGGGGAAGACC-3 ' 

as described in Kambadur et al Genome Research 7: 910-915 (1997) at a 
concentration of 500nM with other reaction components being 2.5mM MgCb, 200|nM 
dNTPs, 50mM KCl, lOmM Tris-HCl pH 8.3, 5 units AmpliTaq Gold (Perkin Ehner). 
1 \x\ of bovine genomic DNA was used as template. Denaturation was carried out for 
12 mm at 94°C followed by 30 cycles of 94^C for 1 min, 55°C for 1 min, 72*^ 1.5 min 
followed by 5 mm at 72°C. Following PGR 2.0|al of loading dye (44.5mM Tris pH 
8.0, 44.5mM boric acid, 0.5mM EDTA, 50%w/v glycerol, 0.02% w/v Orange G) was 
added to lOfil of product and analysis carried out by electrophoresis on a 2% agarose 
gel prepared in 0.5x TBE buffer (44.5mM Tris pH 8.0, 44.5mM boric acid, 0.5mM 
EDTA) for 1 hour at lOOV. 

The remainder of the PGR was analysed for DNA sequencing xxsing ABI dye 
terminator chemistry as described in example 1. 

Results 

Bovine mvostatin DNA polymorphisms and related phenotvne 



Breed 



Phenotype 



nt position 941 length PGR product (bp) 
G 482 



Belgian Blue 



Double muscle 



Piedmontese 



normal 



A 



493 



Holstein-Friesian 



Double muscle 



G 



493 



Example 8 
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RT-PCRof porcine jjTTexon 16-19 

L mRNA purification from blood samples 

Fresh blood samples were collected in citrate tubes from coloured Hampshire pigs and 
Large White pigs. Leukocytes were isolated from 5 ml blood using FicoU 100 
(Pharmacia Biotech). Isolation of mRNA from leukocytes was then carried out using 
the Quickprep Micro mRNA purification kit (Pharmacia Biotech). The mRNA was 
stored as a precipitate under ethanol at -KfC for up to one month before use m 
reverse transcriptase (RT)-PCR. 

ii. RT-PCR of KIT exon 16-19 

First-strand cDNA synthesis was accomplished using the First-Strand cDNA 
Synthesis kit (Pharmacia Biotech) so that -100 ng mRNA was randomly primed by 
0.1 \xg pd(N6) in a total volume of 15 jul. Two |al of the completed first cDNA strand 
reaction was then directly used per 12 \i\ PGR reaction by adding 10 \x\ PGR mix 
containing 10 pmol each of the mouse/human derived primers KITIF and KIT7R (5'- 
TCR TAG ATA GAA AGA GAY GTG AGT G and 5'-AGG GTT GGT TGA TGA 
TGT TGT AG, respectively; MoUer et al. 1996, supra\ 1.2 \x\ 10 x PGR-buffer (10 
mM Tris-HGl, pH 8.3, 50 mM KGl) and 0.5 U of AmpliTaq polymerase (Perkin- 
Ehner) mcubated with an equal amount Taqstart antibody (Glontech) at 25 °G for 5 
min to achieve a hot start PGR. The reaction was covered with 20 \i\ mineral oil and 
thermocycled in a Hybaid Touchdown machine (Hybaid) with 40 cycles at 94°G for 1 
min, 55-48 ^G (touchdown one degree per cycle the first seven cycles and then 48°G 
in the remaining cycles) for 1 min and 72°G for 1 min. After PGR 2|il loading dye 
was added to each sample which were then loaded on 4% agarose gel 
(Nusieve/Seakem 3:1, FMG Bioproducts) and electrophoresed with lOOV for 80 min. 
Products were visuaUsed by ethidium bromide staining and UV-illumination. 

iii. Gloning and sequencing of RT-PGR-nroducts 

The RT-PGR products representing KIT oxon 16-19 were purified by extraction from 
2% agarose gels using the QIAEX gel extraction kit (QIAGEN) and cloned into the 
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pUC18 vector using the Sureclone ligation kit (Pharmacia Biotech). Plasmids were 
isolated using the QIAFilter plasmid Midi kit (QIAGEN). Cloned plasmid inserts 
were sequenced using dye primer chemistry. Each cycling reaction was prepared with 
plasmid template DNA and ready reaction mix containmg fluorescently labelled M13 
forward or reverse primer as described in the ABI Prism protocol P/N 402 11 3 (Perkin 
Elmer). Cycling and sample pooling were performed usmg a Catalyst 800 Molecular 
Biology Workstation (ABI) following the instruments user manual (Document 
number 903877, Perkin Ehner). The resultmg extension products were purified, 
loaded and analysed using the 377 ABI Prism sequencer as described by the 
instrument protocol P/N 402078 (Perkin Ehner). 

iv Results and discussion 

A 424 bp fragment including KIT cDNA exon 16-19 was amplified from all pigs. The 
Hampshire pigs did not show any additional products whereas the Large White pigs 
(eight tested) all showed a 301 bp truncated cDNA fragment (Fig 4). Sequence 
analysis revealed the 424 bp fragment was identical in the two breeds whereas the 
whole exon 17 (123 bp) was missing from the 301 bp fragment. Apparent differences 
between individuals regarding the relative amounts of these two products may have 
been caused either by different genotypes contaming differing numbers of copies of 
the Z/r gene sequence, individual differences in mRNA expression levels or random 
RT-PCR effects. 

The two upper fragments present in Large white pigs represent heteroduplexes 
between the 301 and 424 bp fragments (Fig. 2). This was shown by an experiment 
where these slow migrating fragments were generated by pooUng homoduplexes of 
the 424 and 301 bp which were then heat denatured and cooled to 25°C. Moreover, 
cloning of the lower heteroduplex fraction of a Large White pig resulted in clones 
with insert length corresponding to either of the two homoduplexes. 
Example 9 



PCR Amplification and Sequencing of KIT Exon 17-Intron 17 rS' Splice Site^ 
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L PGR to produce DNA Sequencing Template 

A 175 bp region including the boundary between exonl7 and intronl? of the KIT 
gene was amplified for sequence analysis using forward primer KIT21 (5' - GTA 
TTC ACA GAG ACT TGG CGG C -3') and reverse primer KIT35 (5' - AAA CCT 
GCA AGG AAA ATC CTT CAC GG - 3'). PGR was carried out on a DNA thermal 
cycler (Perkin Elmer 9600) in a total volume of 20 ^l containing 25 ng genomic 
DNA, 1.0 mM MgCls, 50 mM KCl, 10 mM Tris-HCl, pH 8.3, 200 ^iM dNTPs, 0,5 U 
AmpliTaq Gold (Perkin Ehner) and 10 pmol of both KIT21 and KIT35 primer. To 
activate AmpliTaq Gold, initial heat denaturation was carried out at 94°C for 10 
minutes followed by 32 cycles each consisting of 45 sec at 94°C, 45 sec at 55°C and 
45 sec at 72*'C. The final extension lasted for 7 min at 72°C. PGR products were 
cloned into vector pUC18 using the SureClone ligation kit (Pharmacia Biotech). 

iL Preparation of Plasmid DNA 

Plasmid DNA was purified from overnight bacterial culture using the Jetstar plasmid 
midi kit (Genomed) and the resulting DNA diluted to 150 ng/|ul. 

iii. Sequencing of plasmid DNA 
DNA was sequenced as in example 8. 

iv. Results 

A portion of the DNA sequence fi-om exon 17 and intron 1 7 of the KIT gene was 
determined and compared between animals with each of these three alleles. Figure 5 
shows that the / allele carries a splice site mutation at position 1 of mtron 17, This G 
to A base substitution is present in one of the two gene copies carried on each 
chromosome. The base substitution occurs in the invariant GT dinucleotide which 
characterises 5' exon/intron boundaries. Analysis of the / allele showed the splice 
site mutation was not present in either the normal (KITl) or duplicated copy of the 
gene (KIT2). We have foxmd the splice site mutation is unique to the / alleles, and 
therefore makes it possible to distinguish the I-KIT2 sequences. 



Example 10 
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Testing For the Presence of the Splice Site Mutation with PGR RFLP 

To easily test for the presence of the G to A splice site mutation, restriction 
endonuclease Nlalll (CATG) was used to exploit the point substitution identified at 
position 1 of intron 17 (Figure 5). The Malll recognition sites in the fragment 
amplified from KIT and the expected restriction products are illustrated in Figure 6 A 
and 6B respectively. 

L PGR to produce DNA for RFLP Test 

The PGR to produce DNA for RFLP analysis was performed exactly as described in 
example 9. 

iL Restriction Enzyme Digestion and Electrophoresis 

The PGR amplification product is 175 bp in length. To test for polymorphism at 
position 1 of intron 17, digestion reactions were set up as below: 

3.0 ]Lil PGR amplified DNA 
LO ^il 10XNEBuffer4 
0.1 lalBSA 100 ^ig/ml 
O.l^lMallllOU/^il 
5.8 III dH20 

(1 X NEBuffer 4 (New England Biolabs) contains 50 mM potassium acetate, 20 mM 
Tris acetate, 10 mM magnesium acetate and 1 mM DTT). Following incubation at 
37°G for 90 minutes each 10 ]xl reaction volume had 2 jiil of loading dye added and 
the mix loaded on a 8% native polyacrylamide gel (Protogel, 37.5:1 
acrylamiderbisacrylamide , National Diagnostics, Atlanta) in 0.5 X TBE (44.5 mM 
Tris pH 8.0, 44.5 mM boric acid and 0.5 mM EDTA) and electrophoresed for 3 hours 
at 200V in a vertical slab xmit (SE600 Hoefer Scientific Instruments). Products were 
visualised by ethidium bromide staining. 

iii Results 
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A PGR RFLP protocol was designed to test for the presence of the splice site mutation 
as the substitution occurs within the recognition site for restriction endonuclease 
Main. Figure 6B illustrates that presence of the G to A base substitution at position 1 
of KIT intron 17 results in restriction at each of two Nlalll recognition sites within the 
175 bp DNA fragment. Following electrophoresis, this results in fragments of sizes 80 
bp, 54 bp and 41 bp. Where the splice site mutation is absent however, incubation 
with Nlalll results in digestion only at recognition site L Following electrophoresis 
this results in fragments of 134 bp and 41 bp. The invariant Malll recognition site 1 
serves as an mtemal control to ensure complete digestion has taken place. Results of 
this PGR RFLP analysis are illustrated in Figure 6G. Analysis was performed on 
fragments amplified from clones which either carry the splice site mutation (lane 2) or 
carry the normal splice site sequence (lane 3). Lane 4 shows the resuh of analysis 
where DNA amplified from the genomic DNA of a coloured anunal was used. Lane 5 
shows the resulting bands where a white animal was tested. The test was used to 
analyse 121 individuals from seven different breeds of pig. The splice site mutation 
was found only in the 97 animals with the dominant white phenotype (//- or /*//) and 
none of the 24 coloured (/ or /) examples (see table below). This analysis confirms / 
and /* to be unique in that they are the only alleles to carry the sphce site mutation. 

Distribution of the Splice Site Mutation Between Different Breeds and Coat 
Phenotype 



Breed Coat Assumed Animals Normally spliced Splice 

Colour Genotype* Tested KIT^ Mutation^ 



Large White 


white 


I/- 


33 


33 


33 


Landrace 


white 


I/- 


56 


56 


56 


Hampshire 


coloured 


i/i 


5 


5 


0 


Duroc 


coloured 


i/i 


5 


5 


0 


Pietrain 


coloured 


i/i 


8 


8 


0 


Meishan 


coloured 


i/i 


5 


5 


0 


Wild Boar 


coloured 


i/i 


1 


1 


0 


Wild Boar 


white 


I*/. 


8 


8 


8 



X Large 
White 
Totals 
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white I/- 89 89 8S 
white I*/. 8 8 g 
coloured i/i 24 24 0 

^ White animals may be homozygous or heterozygous for the / allele 
Presence of the splice site mutation determined by Malll PCR RFLP test 

Example 1 1 



Quantificati on of Normal KIT and SnUce Mutant KIT flntron 17 ntl 
As the splice site mutation is present in only one of the duplicated regions of /and not 
in the duplicated region of /, the various genotypes can be expected to have the 
attributes described in the table below: 



Genotype Copies of Normal Copies of KIT Ratio of normal KIT to 

KIT containing the splice splice mutant KIT 

mutation 



m 

I/i 
i/i 

f/i 



2 
2 
2 
3 
3 



2 
1 
0 
1 
0 



1:1 
2:1 
2:0 
3:1 
3:0 



Due to the dominance of allele /, three of the genotypes in the table are carried by white 
animals and therefore can not be identified by phenotypic characterisation. 
Quantification of the relative amounts of the normal KIT gene and the splice mutant 
KIT gene allows the ratio between the two to be calculated, and therefore the genotype 
of individual animals predicted. This was achieved by quantification of two DNA 
fi-agments following NlaRl digestion. The amount of 134 bp fragment, representative of 
the normally spliced KIT gene, and of 54 bp fiagment, representative of the splice 
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mutant KIT, were measured following electrophoresis using GeneScan software. 
L PGR to Produce DNA for Ouantificatinn 

As described in example 9 section i. The reverse primer KIT35 is labelled with the ABI 
fluorescent dye FAM at the 5' end. 

ii Restriction Enzvme Digestion 
As described in example 9 section ii. 

iii Electrophoresis and Quantification of DNA Fragments 

Following digestion, 0.5 jliI of the reaction volume was mixed with 2.5 yl of deionised 
formamide, 0.5 jil of GS350 DNA standard (ABI) and 0.4 jul blue dextran solution 
before being heated to 90°C for 2 minutes and rapidly cooled on ice. Three ^1 of this 
mix was then loaded onto a 377 ABI Prism sequencer and the DNA fragments 
separated on a 6% polyacrylamide gel m 1 X TBE buffer for 2 hours at 700 V, 40 mA, 
32 W. The peak area of fragments representative to both the normal and splice mutant 
forms of jOT were quantitated using the GeneScan (ABI) software. 

iv. Ratio Calculations 

The peak area value of the 134 bp fragment (nomial KIT) was divided by twice the 
peak area value of the 54 bp fragment (splice mutant KII) in order to calculate the ratio 
value for each sample. 

V. Results 

Analysis was performed on animals from the Swedish wild pig/Large White intercross 
pedigree for which genotypes at / have been determined by conventional breeding 
experiments with linked markers. Figure 7 and the table below show the ratio of nomial 
to mutant KIT calculated for animals from each of the three genotype classes, I/I 
(expected ratio 1:1), I/i (expected ratio 2:1) and Uf (expected ratio 3:1), The results are 
entirely consistent with the expected ratio values and indicate that the three genotype 
classes can be distinguished using this method. 

Ratio of the Two KIT Forms in Different Dominant White Genotypes in a Wild 



Pig/Large White Intercross 
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Genotype Phenotype Expected Observed Ratio Number 

Ratio (NormaliMutant) Tested 
(Normal: 
Mutant) 

±SE 



m white 1:1 1.15 ±0.075 13 

I/I white 3:1 3.11 ±0.084 12 

I/I white 2:1 2.23 ±0.109 14 



Figure 7 illustrates that the range of ratio values calculated for the two genotypes I/I 
and /// do not overlap. This enables animals carrying the / allele to be identified and 
the frequency of the allele within different pig breeds determined. Ratio values were 
calculated for 56 Landrace and 33 Large White animals and the results are shown in 
Figure 8. A clearly bimodal distribution is observed Mdth 7 Landrace and 3 Large 
White individuals having a ratio value of approximately 3 or above, suggesting them 
to be heterozygous carriers for the / allele (genotype I/f\ This means / has gene 
frequency estimates of 6.25% (7/1 12 chromosomes tested) and 4.5% (3/66 
chromosomes tested) within the Landrace and Large White breeds respectively. 



EXAMPLE 12 



(i) DNA Preparation 

DNA can be prepared from any source of tissue containing cell nuclei, for example 
white blood cells, hair follicles, ear notches and muscle. The procedure outlined here 
relates to blood cell preparations; other tissues can be processed similarly by directly 
suspending material in K buffer and then proceeding from the same stage of the blood 
procedure. The method outUned here produces a cell lysate containing crude DNA 
which is suitable for PGR amplification. However, any method for preparing 
purified, or crude, DNA should be equally effective. 

Blood was collected in 50 mM EDTA pH 8.0 to prevent coagulation. 50 p.1 of blood 
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was dispersed into a small microcentrifuge tube (0.5 ml Eppendorf or equivalent). 
450 ^l of TE buffer was added to lyse the red blood cells (haem groups inhibit PGR) 
and the mix vortexed for 2 seconds. The intact white and residual red blood cells 
were then centrifuged for 12 seconds at 13,000 g in a microcentrifuge. The 
supernatant was removed by gentle aspiration using a low pressure vacuum pump 
system. A further 450 |li1 of TE buffer was then added to lyse the remaming red blood 
cells and the white blood cells collected by centrifiigation as before. If any redness 
remained in the pellet, this process was repeated until the pellet was white. After 
removal of the last drop of supernatant from the pelleted white blood cells, 100 |j,l of 
K buffer containmg proteinase K was added and the mixture incubated at 55°C for 2 
hours. The mixture was then heated to 95-100°C for 8 minutes and the DNA lysates 
stored at -20'*C until needed. 

Reagents 

TE buffer: lOmMTRIS-HClpH 8.0 

1 mM EDTA 
K buffer: 50 mM KCl 

10mMTRIS-HClpH8.3 

2.5 mM MgCb 

0.5% Tween 20 

Prior to use for lysates, 10 |xl of 20 mg/ml proteinase K (Molecular Probes Inc.) per 
1 .0 ml of K buffer was added. 

(ii) PGR 

Reactions were set up as follows in thin walled 0.25 ml tubes (Perkin Elmer): 

4.0 nl 5 |uM CRC Forward primer; 
4.0 \il 5 |j,M CRC Reverse primer; 
4.0 |ul 5 ^M KITl-HEV primer; 
4.0 nl 5 laM Km-FOK primer; 
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4.0 III 2 mM dNTPs (Pharmacia); 
4.0 nl 35 mM MgClj. 

A wax bead (PGR Gem 50, Perkin Elmer) was added and the tube placed m a Perkin 
Elmer 9600 thermal cycler. The tube was then raised to 80°C for 15 seconds followed 
by cooling to 4°C. A second set of reagents was then added to each tube as below:- 

4.0 jal 1 Ox buffer; 

9.6 fxl sterile deionised water; 

0.4 ^1 (0.5 units) AmpliTaq DNA polymerase (Perkm Elmer); 
2 1x1 DNA lysate. 

Reaction tubes were then placed on a Perkin Ehner 9600 thermal cycler preheated to 
94°C and PGR carried out according to the regime indicated below:- 

94°C for 4 minutes; 

20 cycles of 94°G for 30 sees, 62°G for 30 sees and 72°C for 30 sees; 
0°G until required. 

The number of cycles may vary depending upon the tissue used as the DNA source. 
primers 

Forward GAATATTGTTGGTATGGTGATCTCC KITl-FOR 
Reverse GGGGTTGTGGGTGATGTTCGTG A771-REV 

CRC primers 

Forward GTGGATGTGGTGTGTTCGCTGT GRG-FORWARD 
Reverse AGGTTTGTCTGCAGCAGAAGCTC CRC-REVERSE 

The reverse KIT primer and the forward CRC pruner are labelled with the ABI 
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fluorescent dye FAM at the 5' end. 

(iii) Electrophoresis and Quantitation of DNA Fragments 

1 (al of the PGR was mixed with 2.5 ^il of deionised formamide, 0.5 of GS350 
DNA standards, 0.4 |al blue dextran solution, heated at 90*'C for 2 minutes followed 
by rapid coolmg on ice. 3 jil of this mix were then loaded onto an AB 1373 DNA 
sequencer and DNA fragments separated on a 6% polyacrylamide gel in 1 x TBE 
buffer for 2 hours at 700 V, 40 mA, 32 W. The fragments corresponding to the 
products from the KIT and CRC genes were quantitated using GeneScan software, the 
peak area for each of the bands being determined, 

(iv) Results 

The data given in the table below represents the results obtained from an experiment 
m which DNA lysates were produced from each of 23 animals, with two PGR tests 
being carried out on each lysate. The ratio of KIT pt^k area to CRC peak area was 
calculated for each PGR and the average taken of those samples from the same 
animal 



Animal 


Genotype 


KIT/CRC peak 
area ratio 


1 


II 


3.25 


2 


li 


2.45 


3 


II 


2.94 


4 


ii 


1.16 


5 


ii 


1.34 


6 


ii 


1.20 


7 


Ii 


2.18 


8 


Ii 


2.19 


9 


II 


2.88 


10 


ii 


1.30 


11 


Ii 


1.84 


12 


II 


2.84 


13 


ii 


1.50 
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14 


11 


1 


15 


t; 
11 




16 


11 


1 '2 1 


17 


11 


11/1 
1.14 




11 


Z.VZ 


19 


li 


1.87 


20 


li 


2.00 


21 


ii 


0.99 


22 


ii 


1.15 


23 


n 


2.80 



The upper and lower limits for the ratio values from animals of the different 
genotypes II, Ii and ii in this experiment are as below: 



Genotype Upper Limit Lower Limit 

Ifl 3.25 2.80 

I/i 2.45 1.84 

i/i L50 0.99 



These results illustrate differentiation of the genotypes using this test. 
EXAMPLE 13 

The second test utilises unique sequences of DNA that are present at one end of the 
duplication (or both ends if the duplicated region is reversed relative to the rest of 
the gene or if the duplicated region does not occur in direct tandem with the non- 
duplicated region). Oligonucleotide primers for use in PGR are designed such that 
at the annealing temperatures used in the PGR process, they will anneal only to the 
junction regions at the end of the duplicated region. A PGR is then carried out 
using two pairs of oligonucleotides. One pair consists of the aforementioned 
primer spanning the junction region and a second primer a suitable distance away 
which allows amplification to occur only from / allele containing duplication. The 
second pair of primers allow amplification of a sequence present only as a single 
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copy in the haploid genome. The product of this reaction, carried out in the same 
tube, functions as an internal standard as in the previous test. The ratio of product 
from the reaction specific to the junction region is measured relative to that from 
the single copy control sequence. 

In this test there is a larger difference between the predicted ratios of the products 
from the different genotypes. The relative levels of product and their ratios are 
illustrated below:- 

Junction Control 
Genotype Product Product Ratio 

n 2 2 1:1 

1 2 1:2 

ii 0 2 0:2 

These larger ratios allow greater differentiation between the ranges of results 
obtamed from the different genotypes, reducing risks of miss-scoring animals. 

EXAMPLE 14 

(i) DNA Preparation 

DNA can be prepared as described in example 12. 

(ii) PCR 

Reactions were set up as follows in thin walled 0.25 ml tubes (Perkin Elmer): 

2,0 ^1 5 mM KITDEL2'¥OR primer; 
2.0 )il 5 mM ^7DEL2-REV primer; 
1.0 ^1 2 mM dNTPs (Pharmacia); 
1.2 lal 25 mM MgC12 
2.0 jxl 1 Ox buffer (without MgC12) 

0.1 |al (0.5 units) AmpliTaq DNA polymerase (Perkin Ehner); 

2,0 ^l DNA lysate; 

9.7 |Lil sterile deionised water. 
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Reaction tubes were then placed on a Perkin Elmer 9600 thermal cycler and PGR 
carried out according to the regime indicated below:- 

95°C for 1 minute; 

3 cycles of 95°C for 15 sees, 50°C for 20 sees and 72°C for 40 sees; 
27 cycles of 94°C for 15 sees, 50X for 20 sees and 72°C for 50 sees; 
72°C for 5 minutes; 
4°C until required. 

The number of cycles may vary depending upon the tissue used as the DNA 
source, 

A7r primers 

Forward GAAAGTGA(C/T)GTCTGGTCCTAT(C/G)GGAT KITDEL2- 

FOR 

Reverse AGCCTTCCTTGATCATCTTGTAG KITDELl-REV 

(in) Electrophoresis 

1 (xl of the PGR product was mixed with 3 |il loading buffer (95% deionised 
formamide, lOmM NaOH, 20mM EDTA, 0.05% bromophenolblue, 0.05% 
Xylene-cyanol), heated to 95°G for 3 minutes followed by rapid cooling on ice. 
The sample was then loaded on an 8% native polyacrylamide gel (Protogel, 37.5:1 
Acrylamide:bisacrylamide, National Diagnostics, Atlanta) in 1 x TBE buffer 
(89mM Tris, 89mM boric acid, 2mM EDTA,Na2). The DNA fragments were 
separated by electrophoresis for 4.5 hours at 6W with a constant temperature of 
20°G and 0.6 x TBE as running buffer in a vertical slab unit (SE600 Hoefer 
Scientific Instruments, San Francisco). 

(iv) Visualisation of DNA fragments by silver staining 

After electrophoresis the gel was incubated, with gentle agitation, in the fix 



54 

solution for 20 minutes or until the tracking dyes were no longer visible. The gel 
was rinsed three times (2 minutes each with agitation) in deionised water. The gel 
was then incubated in the staining solution for 40 minutes, with gentle agitation, 
followed by a brief wash (5-10 seconds) in deionised water and direct transfer to 
the developing solution. The gel was incubated in the developing solution xmtil 
bands were clearly visible and then the development was terminated by adding an 
equal volume of fix solution. Finally, the gel was rinsed for 2 minutes in 
deionised water. 

Reagents 

Fix solution: 1 0% glacial acetic acid m deionised 

water 

Staining solution: 2 g silver nitrate (AgN03) 

3 ml 37% formaldehyde 
2 litres deionised water 

Developing solution: 60 g sodium carbonate (Na2C03) dissolved in 2 liters 
deionised water. Immediately before use add 3 ml 37% 
formaldehyde and 400 ml sodium thiosulfate (10 mg/ml). 
The solution should be at a temperature of 10-12°C when 
used. 

(v) Results 

This SSCP analysis reveals an informative polymorphism so far only found in 
animals with the dominant white phenotype (Fig. 9). In lanes 1 to 8 the analysis 
was carried out on DNA from Swedish Landrace pigs carrying the dominant white 
colour and in lanes 9 and 10 DNA was from vrild pigs of wild type colour. The 
polymorphic bands are indicated. The polymorphism is characterised by two 
unique fragments only present in anhnals carrying a duplicated KIT gene of allele 
type /. The fragments represent heteroduplexes of DNA strands from PGR products 
of unequal length representing the duplicated and non-duplicated copy of the KIT 
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gene. The results of a screening test with this marker using 40 unrelated animals 
representing five breeds and 190 F2 animals fi-om a Large White/Wild pig 
intercross are presented in the table below: 



BREED 


COLOUR 


NO. OF 

A XTTA jT A T C 

ANIMALS 


HETERODUPLEX 










NOT 

PRESENT 


SWEDISH 
LANDRACE 


WHITE 


10 


10 


0 


SWEDISH LARGE 
WHITE 


WHITE 


8 


8 


0 


SWEDISH 
HAMPSHIRE 


COLOURE 
D 


10 


0 


10 


SWEDISH 
DUROC 


COLOURE 
D 


10 


0 


10 


WILD PIG 


COLOURE 
D 


2 


0 


2 


LARGE WHITE/ 
WILD PIG 
INTERCROSS 


WHITE 
PATCH 
COLOURE 
D 


131 
9 

50 


106 

0 

0 


25 
9 

50 



The results show that this particular polymorphism is very closely associated with 
the presence of the A7r duplication. It is not completely associated with the 
duplication as some white animals did not show the heteroduplex pattem. The 
polymorphism is therefore an example of a closely linked genetic marker which by 
itself or in combination with other linked markers can be used to differentiate 
genotypes as regards the dominant white coat colour. 

EXAMPLE 15 



i) DNA extraction 
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DNA was prepared as in example 12. 
ii) PCR 

Reactions were set up in 0.25ml thin walled reaction tubes (Perkin Elmer) as 
follows: 

O.S^il 5 ^M^mDELl -FOR primer 

0.5^1 5 ^iMZ/TDELl -REV primer 

1.0^,1 2mM dNTPs (Pharmacia) 

l.Ojal 15mMMgCl2 

1.0^1 lOX buffer 

4.9jxl Sterile distilled water 

0. 1 1^1 AmpliTaq DNA polymerase 

LOfil DNAlysate 

Reaction tubes were then placed in a Perkin Ehner 9600 thermal cycler and PCR 
carried out according to the regime 

94°C for 4 minutes; 

21 cycles of 94°C for 30 sec, 60°C for 30 sec, and 72°C for 30 sec 

72°Cfor4min; 

4°C until required. 

The number of cycles used may vary depending on the tissue used as the source of 
the DNA. 

Primers 

forward TGTGGGAGCTCTTCTCTTTAGG KITDEL 1 -FOR 

reverse CCAGCAGGACAATGGGAACATCT KITDELl-KEV 

The reverse primer was labelled with the ABI fluorescent dye FAM at the 5' end. 
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iii) Electrophoresis and quantitation of DNA fragments 

of the PGR was mixed with L5^1 of deionised formamide, 0.25|li1 of GS350 
DNA standards, 0.25fal loading buffer (50mg/ml blue dextran, 25mM EDTA) and 
heated at 90'C for two minutes followed by rapid cooling on ice. L75]li1 of this was 
then loaded onto an ABI 377DNA sequencer and DNA fragments separated on a 
4. 12% polyacrylamide gel in Ix TBE buffer for two hours at 3000V, 60mA, 
200W and 48 C. The 97bp and 93bp fragments corresponding to the products from 
the KIT gene template lacking the deletion and containing the deletion respectively 
were quantitated using GeneScan software, the peak area for each of the bands 
being determined. 

Results 

The data given in the table below represents the results obtained from an 
experiment in which DNA lysates were produced from each of 20 animals of 
known genotype with one PGR test being carried out on each lysate. The ratio of 
the peak area of the product from the DNA template not containing the four base 
pair deletion to that containing the deletion was calculated. 



ANIMAL 


GENOTYPE 


Non del/del 
peak area ratio 


1 


// 


1.347 


2 


// 


1.21 


3 


II 


1.33 


4 


II 


2.267 


5 


II 


0.444 


6 


II 


0.713 


7 


II 


8.387 


8 


II 


0.994 


- 9 


II 


1.673 


10 


II 


1.056 


11 


li 


1.751 


12 


li 


1.73 
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13 


Ti 
11 


1.83 


14 


11 


0.631 


15 


11 


1.975 


16 


li 


2.147 


17 


li 


1.901 


18 


li 


1.749 


19 


li 


2.103 


20 


li 


2.026 



For this small sample the value of 1 .5 which is midway between the predicted ratio 
values for each genotype (expected ratio=2 for li and 1 for II) might be used as the 
dividing Une for scoring the anhnals to either genotype. It can be determined from 
the table that 7/10 //and 9/10 // are identified as the correct genotype. 



Example 16 

Sequencing of KIT cDNA clones 

mRNA was isolated from peripheral blood leukocytes from white (Landrace/Large 
White) and coloured (Hampshire) pigs using the Message Maker mRNA isolation 
system (Gibco BRL) with one mRNA selection from total RNA, lOOng poly(A)*^ 
mRNA was reverse-transcribed with random primers (First-Strand cDNA Synthesis 
kit, Pharmacia Biotech) and the product was used at a 1:10 dilution for RT-PCR 
using the proof-readmg Advantage KlenTaq Polymerase (Clontech) according to the 
manufacturer's recommendation. The foUovraig primers were used to amplify ahnost 
the entire coding sequence and some of the 5' untranslated region: KIT40 (5'-GGC 
TCT GGG GGC TCG GCT TTG C) corresponding to the 5 'untranslated region and 
KIT22S (5'- TCA GAG ATC TTC GTG GAG AAG GAG AGG) corresponding to 
exon 21; both primers had been designed using consensus sequence of the human 
and mouse KIT sequences in the GENBANK database. The RT-PCR products were 
gel purified and cloned using the pGEM-T vector system (Promega). Plasmid clones 
were sequenced using a set of internal primers and the ABI Prism™ dRhodamine 
Terminator Cycle Sequencing Kit (PE AppUed Biosystems). Two subclones 
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representing each type of KIT sequence were initially sequenced and in those cases 
where a discrepancy was observed (possibly due to PGR errors) additional clones 
were sequenced over those particular nucleotide sites. RT-PCR analysis of JOT^exon 
16-19 was carried out with the primers KITIF (5'-TCR TAG ATA GAA AGA GAY 
GTG ACT C) and KIT7R(5'-AGC GTT GGT TGA TGA TGT TGT AG). 

Results 

The sequence of the KIT gene coding region derived from an animal of the 
Hampshire Breed is shown in Figure 10. Differences between iiZTcDNA 
sequences cloned from a Hampshire and a Yorkshire/Landrace pig, respectively 
are shown in the table below. The sequence comparison includes the whole open 
reading frame 2919 bp, except for the last 27 bp occupied by the reverse PGR 
primer. Exon and base pair position number as well as amino acid codon are given 
for each difference. Polymorphic bases are shown in bold. A dash indicates identity 
with the Hampshire (/) allele. 
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Example 17 
DNA preparation 

Genomic DNA was prepared as described in example 12. 
PGR 

A 158bp fragemnt covering 99bp of the end of exon 19 and 59bp of the KIT gene was 
amplified using forward primer LA93 (5'-GAG CAG CCC CTA CCC CGG AAT GCC 
AGT TGA-3') and reverse primer KIT56 (5'-CTT TAA AAC AGA ACA TAA AAG 
CGG AAA CAT CAT GCG AAG G-3'). PGR was carried out on a Perkin Ehner 9600 
Thermal Cycler in a total volume of 20^1 contaming 25ng genomic DNA, 1.5mM 
MgCl2, 50mM Kcl, lOmM Tris-HCl, pH 83, 200nM dNTPs, 0.5u AmpliTaq Gold 
(Perkin Elmer) and 10 pmol of both LA93 and KIT56 primer. To activate AmpliTaq 
Gold, initial heat denaturation was carried out at 94°C for 10 minutes followed by 32 
cycles each consisting of 45 sec at 94''C, 45 sec at 55°C and 45 sec at 72°C. 

Restriction digestion and electrophoresis 

The PGR amplification product is 158 bp in length. To test for polymorphism at 
position 93 of this product (corresponding to position 2678 of the KIT cDNA sequence) 
digestion reactions were set up and incubated as below: 

e.Ojxl PGR product 

1.0|al lOx reaction buffer 3 (New England Biolabs) 
0.2^1 ^czl (5u/|il) 
2.8|il deionised water 

Following digestion at 37°C for 120 minutes each lOjil reaction volume had 2)li1 of 
loading dye aded and the mix was loaded on an 8% native polyacrylamide gel 
(Protogel, 37.5:1 acrylamiderbisacrylamide. National Diagnostics, Atlanta) in 0,5 x 
TBE (44.5 mM Tris pH8.0, 44.5 mM boric acid and 0.5 mM EDTA) and 
electrophoresed for 3 hours at 200v in a vertical slab gel unit (SE600 Hoefer Scientific 
Instruments). Products are visualised by ethidium bromide staining. 
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Results 

The reverse primer is designed such that an Acil site is introduced into the amplified 
sequence. This results in digestion of amplicon with Acil releasing a fragment of 23bp 
that allows confirmation of the digestion process. Digestion of the remaining 135 bp 
fragment into fragments of 92 and 43 bp is dependant on the nucleotide at the position 
corresponding to position 2678 of the KIT cDNA sequence. T at this position prevents 
digestion while a C at this position allows digestion. Gel resolution is not sufficient to 
allow resolution of the 23bp fragment but comparison to undigested product allows 
confirmation of the process. 



Figure 1 1 illustrates the results obtained with animals of a range of genotypes. 

The test was used to analyse a total of 66 unrelated individuals from seven breeds of 
pig. The results are shown in the table below: 



Breed 


No. 


KIT Genotype* 


Genotype at pos'n 2678 










C/C 


cm 


T/T 


Hampshire 


4 


m 


1 


1 


2 


Polish Wild Boar 


13 


i/i 


0 


1 


12 


Duroc 


11 


i/i 


0 


1 


10 


Pietrain 


1 


i/i 


0 


0 


1 


Swedish Wild Boar 


1 


i/i 


1 


0 


0 


Swedish Landrace 


12 


I/I 


0 


0 


12 




5 


I/F 


0 


2 


3 


Swedish Yorkshire 


14 


I/I 


0 


1 


13 




5 


I/F 


0 


1 


4 



i Genotype based on Mali! RFLP analysis as described in example 1 1 . 
Example 18 
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Determination of genotype at the / locus using a rapid DNA based test 
Crude DNA lysates were prepared from hair samples from animals of three breeding 
lines, a Hampshire based line, a Large White line, and white animals from a cross bred 
line originally produced from the two former lines. Four hair follicles were placed into 
lOOjil of K buffer (50mM KCl, lOmM Tris-HCl pH 83, 2.5mM MgCla, 0.5% w/v 
Tween 20) and l|al Proteinase K (15 mg/ml) (Boehringer) added. This mix was 
incubated for 2 hours at 55°C followed by 16 min at 95 °C. DNA was also prepared as 
described in example 12. 

Allelic discrimination reactions were set up using the PE Applied Biosystems 
TaqMan™ system. 25^1 reactions contained the primers E19F0R (5- 
GAGCAGCCCCTACCCCGGAATGCCAGTTGA-3') and E19REV (5- 
CTTTAAAACAGAACATAAAAGCGGAAACATCATGCGAAGG-3') at 300nM, 
8% glycerol (w/v) IX TaqMan™ buffer A (PE AppUed Biosystems), 5mM MgCb, 
200|LiM dATP, dGTP, dCTP and dUTP, 0.65 units AmpliTaq Gold™ (PE Applied 
Biosystems), 0.25 units AmpErase™ UNG (PE AppUed Biosystems) and the 
TaqMan™ probes E19PC (5'- CATACATTTCCGCAGGTGCATGC-FAM) and 
E19PT (5'- TCATACATTTCCACAGGTGCATGC-TET) at a concentration of 
lOOmM. l)il of crude lysate DNA was used as template. PGR amplification was 
carried out using a PE9600 thermal cycler (PE Applied Biosystems) or a the ABI7700 
Prism (PE Applied Biosystems) with a thermal cycling regime of 50°C for 2 min 
followed by 95°C for 10 min followed by 40 cycles of 95°C 15 sec, 62*^0 1 min. 8 
control samples of each homozygote genotype, 2678C and 2678T, and 8 no template 
controls where deionized water was substituted for template controls were used per 96 
well plate. Allele identification based on these reactions was carried out using the 
allelic discrimination fimction of the ABI7700 Prism (PE Applied Biosystems). 

Results 

The test was used to analyse a total of 20 unrelated individuals from four breeds of pig. 
The results are shown in the table below: 



Breed 



No. Assumed KIT Genotype at pos'n 2678 
Genotype 
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C/C 



C/T 



T/T 



Hampshire 
Landrace 
Duroc 
Pietrain 



5 
5 
5 



i/i 
VI 
i/i 
i/i 



0 
0 
0 



0 
0 
0 



3 
5 
5 
5 



Example 19 



Complete cosegregation of the Belt coat colour locus and KIT 



Method 



Hampshire pigs have a characteristic coat colour phenotype with a white belt on a 
solid black background. Belt is determined by a dominant allele (Be). The segregation 
of the Beh locus was investigated in a backcross between Hampshire (Be/Be) and 
Pietram (be/be) pigs. Fl sows (Be/be) were back-crossed to pure-bred Pietrain (be/be) 
boars. DNA preparations were carried out exactly as described in Example 3. 

KITexon 19PCRRFT.P 

i) PGR to produce DNA for the RFLP test 

A 158 bp fragment covering 99 bp of the 3' end of exon 19 and 59 bp of intron 19 of 
the KIT gene was amplified usmg the following primers: 

forward LA93 5' - GAGCAGCCCCTACCCCGGAATGCCAGTTGA -3' 
and reverse 

KIT56(5' -CTTTAAAACAGAACATAAAAGCGGAAACATCATGCGAAGG -3'). 
PGR was carried out in a total volume of 20 ^il containg 25 ng genomic DNA, 1 .5 
mM MgCl2, 50 mM KCl, 10 mM tris-HCl, pH 8.3, 200 |aM dNTPs, 0.5 U AmpUTaq 
Gold (Perkin Elmer) and 10 pmol of both LA93 and KIT56 primer. To activate 
Amplitaq Gold, initial heat denaturation was carried out at 94°C for 10 minutes 
followed by 32 cycles each consisting of 45 sec at 94°C, 45 sec at 55°C and 45 sec at 
72°C. 
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ii) Restriction enzyme digestion and electrophoresis 

The PGR ampUfication product is 1 58 bp in length. To test for polymorphism at 
position 93 of this product, digestion reactions were set up and incubated as follows: 

6.0 |Lil PGR ampUfied DNA 
1.0 ^l lOXNEBufiferS 
0.2 |xl^c/I(5U/|al) 
2.8 ^il dH20 

(1 X NEBuffer (New England Biolabs) contains 100 mM sodium chloride, 50 mM 
Tris-HGl, 10 mM magnesium chloride, and 1 mM DTT). Following digestion at 3TC 
for 120 minutes, two |al loading dye was added to each sample and the mix loaded on 
a 12 % native polyacrylamide gel in 0,5 % TBE (44.5 mM Tris pH 8.0, 44.5 boric 
acid and 0.5 mM EDTA) and electrophoresed for 3 hours at 200 V in a vertical slab 
imit. Products were visualised by Ethidium bromide staining. 

Results 

KIT nucleotide 2678 is polymorphic and a C or T occxirs at this position. The 
presence of a C creates a restriction site for Aci I which is absent when a T is present. 
A second Acil site has been engineered into the reverse primer KIT56 to serve as an 
internal control of digestion and is therefore invariant. The polymorphism can be 
detected by a simple PGR-RFLP analysis as described in the table below. 

Detection of KIT single nucleotide polymorphism (SNP) at position 2678 

Size in bp of DNA 
Nucleotide fragments after digestion 

G 23+43 +92 

T 23 + 135 



The cosegregation of the Belt and KIT loci m this pedigree is summarised in the table 
below. 
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Cosegregation between KIT and Belt in a Hampshire/Pietrain backcross 



Animal No tested Phenotvpc Belt locus KIT SNP2678 



Fl sows 


14 


Belt 


Be/he 


C/T 


Pietrain sires 


2 


non-Belt 


be/be 


T/T 


Offspring 


41 


Belt 


Be/be 


C/T 


Offspring 


41 


non-Beit 


be/be 


T/T 



The complete cosegregation between the Belt phenotype and the KIT polymorphism 
shows that this phenotype most likely is controlled by a mutation at the KIT locus. 
This means that detection of KIT polymorphism can be used to identify animal 
products derived from Hampshire pigs since the Belt is the most important breed 
determinant in Hampshire pigs. It is hkely that the Beh phenotype present in 
Saddleback and Hannover-Braxmschweig pigs is controlled by the same locus. 

Example 20 

Determination of the sequence of the 5' untranslated and 5' coding region of the 
oMSHR gene 

The entire coding region of the oMSHR gene was determined and compared between 
pig breeds known to carry the different at the E locus, and EF, Hampshire 

carries £^ and has a soUd black body interrupted with a white beh. This belt is the 
result of another coat colour locus. The Wild Boar which carries allele has a 
wildtype phenotype while the Pietrain breed carries allele EF and is characterized by 
having black spots on a white body. 

PCR to produce DNA for clone construction 

The entire coding region of the aMSHR gene was amplified from genomic DNA 
using primers EPIGIO and EPIG16. These primers have sequence: 
EPIGIO 5' - GOT CTA GAT CAC CAG GAG CAC TGC AGC ACC - 3' 
EPIG16 5' - GGG AAG CTT GAC CCC CGA GAG CGA CGC GCC - 3' 
PCR was carried out on a DNA thermal cycler (Perkin Elmer 9600) in a total volume 



of 20 lal containing 25 ng genomic DNA, 1.5 mM MgCb, 50 mM KCl, 10 mM Tris- 
HCl, pH 8.3, 200 dNTPs, 5.0 % DMSO (dimethyl sulfoxide), 0.5 U AmpliTaq 
Gold (Perkin Elmer) and 10 pmol of both EPIGIO and EPIG16, To activate AmpliTaq 
Gold, initial heat denaturation was carried out at 96''C for 10 minutes followed by 32 
cycles each consisting of 45 sec at 94°C, 45 sec at 55^C and 45 sec at 72°C. The final 
extension lasted for 7 min at 72°C. 

Cloning of PCR products 

To facilitate cloning of PCR products, both primers were designed with restriction 
endonuclease recognition sites located at the 5' end. Primer EPIGIO has sequence 
TCTAGA which is cut using enzyme Xbal and EPIG16 contains sequence AAGCTT 
which is cut using enzyme Hindlll, Following PCR as described above, the entire 
reaction volxime was electrophoresed and purified using the Qiaex II gel extraction kit 
following the manufacturers instructions (Qiagen). The purified PCR product was 
digested prior to ligation as follows: 



PCR product 17.0 |il 

5.0u HindLll (Amersham) 1 .0 |il 

5.0u Xbal (Amersham) 1 ,0 p,l 

xlO reaction buffer M (Amersham) 3.0 \i\ 

X 1 0 bovine serum albimiin (Amersham) 3 .0 |il 

H2O 5.0 \i\ 



The reactions were incubated at 37 degrees C for 16 hours before the digested DNA 
was purified by passage through a QIAquick spin column following the 
manufacturers instructions (Qiagen). PCR products were Ugated into 100 ng of vector 
pRc/CMV (Invitrogen) using 400 U T4 DNA ligase (New England Biolabs) in a total 
reaction volume of 20 \x\ containing 10 mM MgC^, 50 mM Tris-HCl, pH 7.5, 10 mM 
dithiothreitol, 1 mM ATP and 25 )ag/ml bovine serum albumin. Ligation reactions 
proceeded at 16 degrees for 16 hours. 

Preparation and Sequencing of Plasmid DNA 

Plasmid DNA was purified from overnight bacterial culture using the Jetstar plasmid 
midi kit 50 (Genomed) and the resulting DNA diluted to 15 ng/jxL Dye terminator 
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cycle sequencing was performed using AmpliTaq DNA polymerase in accordance 
with the ABI Prism protocol P/N 402078 (perkin Elmer). Cycle sequencing reactions 
comprised 1.6 pmole of either T7 or SP6 sequencing primer (Promega), 15 ng of 
plasmid DNA and the terminator ready reaction mix (Perkin Ehner). The cycle 
sequencing reactions were performed in a GeneAmp 9600 machine over 25 cycles, 
each consisting of 10 sec at 96°C, 5 sec at 50°C and 4 min at 60°C. Extension 
products were purified for gel separation using ethanol precipitation, loaded and run 
on a 377 ABI Prism DNA sequencer as described by the instrument protocol (Perkin 
Elmer protocol P/N 402178). 

Results 

A 2bp insertion was identified in the aMSHR gene of pigs of the Pietrain breed which 
carry the allele between nucleotide positions equivalent to 66 and 67 in the Wild 
Boar aMSHR sequence. This results in a shift m the translation frame and creates a 
TGA stop codon at nucleotide positions equivalent to 161 to 163 in the Wild Boar 
aMSHR sequence. The 5' portion of the aMSHR coding sequence compared 
between three breeds is shown below. This comparison illustrates the two base pair 
insertion present within the alleles carried by the Pietrain animal when compared with 
either the Hampshire or Wild Boar alleles.. The ATG start codon is highlighted in 
bold, the 3' end of primer EPIG16 is shown in italics and bases in common with the 
Pietrain sequence are marked with a dash. Missing bases are marked with :. 



Pietrain 
Meishan 
Wild Boar 

Pietrain 
Meishan 
Wild Boar 

Pietrain 
Meishan 
Wild Boar 

Pietrain 
Meishan 
Wild Boar 



CGACGCGCCC 


TCCCTGCTCC 


CTGGCGGGAC 


GATGCCTGTG 


CTTGGCCCGG 




AGAGGAGGCT 


GCTGGCTTCC 


CTCAGCTCCG 


CGCCCCCAGC 


CGCCCCCCCC 




GCCTCGGGCT 


GGCCGCCAAC 


CAGACCAACC 


AGACGGGCCC 


CCAGTGCCTG 




GAGGTGTCCA 


TT 









These results are also incorporated into figure la. 



Example 21 



€9 

A rapid DNA test for the presence of the 2bp insertion mutation in the porcine 
oMSHR gene allowing rapid distinction of the EF allele from all other alleles 
identified at this locus 

PGR was conducted with forward primer EPIG16 (see above) and reverse primer 
MC1R121A exactly as described above. The reverse primer was labeled with ABI 
dye Hex and has sequence: 

MC1R121A 5' - Hex- GGA CTC CAT GGA GCC GCA GAT GAG CAC GGT - 

3' 

Following PGR cyclmg, 0.2 |al of the reaction volume was mixed with 2.5 |li1 of 
deionised formamide, 0.5 \i\ of GS500 DNA standard (ABI) and 0.4 ^il blue dextran 
solution before being heated to 90°C for 2 minutes and rapidly cooled on ice. 1 ^il of 
this mix was then loaded onto a 377 ABI Prism sequencer and the DNA fragments 
separated on a 6% polyacrylamide gel in 1 X TBE buffer for 2 hours at 700 V, 40 
mA, 32 W. The length of the resulting PGR products were determined using the 
GeneScan software (ABI). 

Results 

A test was devised to assay genomic DNA directly for the presence of the identified 2 
bp insertion. Primers EPIG16 and MC1R121A were used to PGR amplify 448 bp of 
the 5' portion of the wildtype porcine MCIR gene. To facilitate fluorescent detection 
of amplified products, the ABI dye HEX was covalently attached to the 5' end of 
primer MG1R121A. PGR was conducted on a number of unrelated individuals from 
three breeds and the resulting PGR products size determined using the GeneScan 
software (ABI). The results are presented in the table below: 



Breed Number Extension Size of PGR product 

tested Genotype 

448 bp 450 bp 



Pietrain 5 EFIEF 0 5 

Large White 3 EF/EF 0 3 

Hampshire 5 5 0 
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Analysis of the length of PGR products amplified between individuals showed either a 
448 bp or 450 bp product resulted. The expected 448 bp fragment was amplified fi"om 
each Hampshire animal, however a product 2 bp longer was detected from each 
Pietrain and Large White animal tested. This indicates the 2 bp insertion identified via 
sequence analysis to be present in the genomic DNA of Pietrain and Large White, two 
breeds ascribed the EF allele, but not m Hampshire which carries 

EXAMPLE 22 

In addition to the coding region of the oMSHR gene, DNA sequence polymorphism 
may exist between breeds within the untranslated regions (UTR). Sequence 
information was collected from the 3' UTR and compared between six breeds of pigs 
which display a variety of coat color phenotypes. 

PCR to produce DNA for sequencing 

A 454 bp product containing 38 coding nucleotides from the 3' portion of the 
molecule and 416 bp of 3' untranslated region (not including primer binding sites) 
was amplified using primers EPIG13 and EPIG14. These primers have sequence: 
EPIGl 3 5 ' - OCA AGA CCC TCC AGG AGG TG - 3 ' 
EPIG14 5' - CAC TGA GCC GTA GAA GAG AG - 3' 

PCR was carried out on a DNA thermal cycler (Perkin Elmer 9600) in a total volume 
of 20 ^1 containing 25 ng genomic DNA, 1.5 mM MgCl2, 50 mM KCl, 10 mM Tris- 
HCl, pH 8.3, 200 dNTPs, 0.5 U AmpliTaq Gold (Perkm Elmer) and 10 pmol of 
both EPIG13 and EPIG14. To activate AmpliTaq Gold, initial heat denaturation was 
carried out at 96°C for 10 minutes followed by 32 cycles each consisting of 45 sec at 
94°C, 45 sec at 55°C and 45 sec at 72X, The final extension lasted for 7 min at 72°C. 

Sequencing of PCR products 

PCR products were sequenced using dye terminator chemisty. This first requires 
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purification of PGR product fi-ee fi-om excess dNTPs and primers. This was achieved 
by passage of the template DNA through a QIAquick spm column following the 
manufacturers instructions (Qiagen). Cycle sequencing reactions were performed on 
20 ng of purified template using either EPIG13 or EPIG14 as described in Example L 

Results 

Primers EPIG13 and EPIG14 were used to amplify a 454 bp region of DNA which 
comprises both the 3' terminal coding region of the MCIR gene and the immediately 
adjacent 3' UTR. The sequence information collected is displayed in figure 15 and 
two polymorphic positions were identified. 

The first polymorphism identified is a 1 bp deletion common to Meishan and Large 
Black which occurs seven positions downstream from the stop codon at the equivalent 
of nucleotide position 1007 in the Wild Boar sequence. As this deletion occurs outside 
the translated region it is not expected to alter the amino acid composition of the 
resulting receptor molecule, however its influence on mRNA stability and 3' end 
formation through endonucleolytic cleavage is unknown. It is unique to two breeds 
which carry and which carry a variant of the oMSHR gene not found in any other 
breed examined to date. 

The second polymorphism is a base substitution at nucleotide position 1 162 imique to 
the European Wild boar. Figure 15 shows five breeds to have a G base at this position 
and the Wild boar to contain an A. This sequence difference offers the possibility to 
distinguish the European Wild boar from the other breeds analysed with a DNA based 
test. 



CLAIMS: 
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1. A method for: 

(a) differentiating animals and animal products on the basis of breed origin; or 

(b) determining or testing the breed origin of an animal product; or 

(c) validating an animal product; 
comprising the steps of: 

(i) providing a sample of the animal product; and 

(ii) analysing the allele(s) of one or more breed determinant genes present in 
the sample. 

2. The method of claim 1 wherein the breed determinant is a monogenic trait, 

3. The method of claim 1 wherein the breed determinant is a polygenic trait. 

4. The method of any one of claims 1-3 wherein the overt phenotypic trait is a 
behavioural or morphological trait. 

5. The method of claim 3 or claim 4 wherein the overt phenotypic trait varies 
qualitatively or quantitatively between breeds. 

6. The method of any one of the preceding claims wherein the breed determinant 
gene analysed in step (ii) is selected from any of: 

(a) a coat colour gene; and/or 

(b) a coat pattern gene; and/or 

(c) a coat texture gene; and/or 

(d) a coat density gene; and/or 

(e) a coat length gene; and/or 

(f) an ear aspect gene; and/or 

(g) a double muscling gene; and/or 

(h) a horn morphology gene; and/or 

(i) a tusk morphology gene; and/or 
(j) an eye colour gene; and/or 
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(k) a plumage gene; and/or 
(1) a beak colour/morphology gene; and/or 
(m) a vocalization (e.g. barking) gene; and/or 
(n) a comb or wattle gene; and/or 

(0) a gene controlling display behaviour. 

7. The method of claim 6(a) v^herein the coat colour gene is the KIT or oMSHR gene 
(for example, the pig KIT or oMSHR gene). 

8. The method of any one of the preceding claims v^herein the sample is a nucleic 
acid sample and the analysing step (ii) comprises DNA or RNA analysis. 

9. The method of any one of claims 1-7 wherein the sample is a protein sample and 
the analysing step (ii) comprises protein analysis. 

1 0. A method of determining the coat colour genotype of a pig which comprises: 

(1) obtaining a sample of pig nucleic acid; and 

(ii) analysing the nucleic acid obtained in (i) to determine which allele or 
alleles of the aMSHR gene is/are present. 

1 1 . The method of claim 8 or claim 10 wherein the analysis step (ii) comprises: 

(a) selectively ampUfying a specific fragment of nucleic acid (e.g. by PGR); 
and/or 

(b) testing for the presence of one or more restriction endonuclease sites 
within the breed determinant gene(s)/aMSHR gene (e.g. restriction fragment 
length polymorphism (RFLP) analysis); and/or 

(c) determuiing the nucleotide sequence of all or a portion of the breed 
determinant gene(s)/aMSHR gene; and/or 

(d) probing the nucleic acid sample with an allele-specific DNA or RNA 
probe; and/or 

(e) carrying out one or more PGR ampUfication cycles of the nucleic acid 
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sample using at least one pair of suitable primers and then carrying out RFLP 
analysis on the amplified nucleic acid so obtained, 

12. A method of determining the coat colour genotype of a pig which comprises: 

(i) obtaining a sample of pig oMSHR protein; and 

(ii) analysing the protein obtained in step (i) to determine the amino acid 
sequence at those positions associated with coat colour genotype or the size of the 
protein 

13. The method of claim 9 or claim 12 wherein the analysis step (ii) comprises: 

(a) probing the protein sample with an antibody (e.g. a monoclonal antibody) 
specific for an allele-specific epitope; and/or 

(b) electrophoretic analysis; and/or 

(c) chromatographic analysis; and/or 

(d) amino-acid sequence analysis; and/or 

(e) proteolytic cleavage analysis; and/or 

(f) epitope mapping and or 

(g) translating a copy of the DNA or RNA of the gene produced by PGR or 
other means in an in-vitro trancription/translation system 

14. The method of claim 7 wherein the analysis step (ii) comprises determining the 
nucleotide sequence of the KIT or oMSHR gene or the amino acid sequence of the 
KIT ox oMSHR protein. 

15. The method of claim 7 or claim 14 wherein the analysis step (ii) comprises 
establishing the presence or absence of at least one nucleotide change in the XZTor 
ccMSHR gene and/or their flanking regions. 

16. The method of claim 10 or claim 1 1 wherein the determination in step (ii) 
involves identifying the presence or absence of at least one missense mutation. 
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insertion or deletion in the the aMSHR gene and/or it's associated flanking regions. 



17. The method of any one of claims 7, 10, 1 1, 14 and 15 wherein the analysis step 
(ii) further comprises determining the association between one or more microsatellite 
or other linked marker alleles linked to the KIT or aMSHR gene and to particular 
alleles of the KIT ox aMSHR gene. 

18. The method of claim 17 wherein the analysis step (ii) is based on the 
identification of microsatellite markers present in the nucleic acid sample. 

19. The method of claim 7 wherein the analysis step (ii) comprises: 

(a) determining the association between one or more microsatellite or other 
linked marker alleles linked to the KIT or aMSHR gene and to particular 
alleles of the KIT or aMSHR gene; 

(b) determining which microsatellite or other linked marker allele or alleles 
are present in the nucleic acid sample. 

20. A method of determining the coat colour genotype of a pig which comprises: 

(i) determining the association between one or more microsatellite or other 
linked marker alleles linked to the aMSHR gene and particular alleles of the 
aMSHR gene; 

(ii) obtaining a sample of pig nucleic acid: and 

(iii) analysing the nucleic acid obtained in (ii) to determine which 
microsatellite or other linked marker allele or alleles are present. 

21. The method of any one of claims 7, 10, 1 1 and 14-20 wherein the analysis step 
(ii) further comprises the step of determining the genotype of at least one additional 
locus. 
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22. The method of claim 21 wherein the additional locus is an additional coat colour 
locus. 

23. The method of claim 22 wherein the additional coat colour locus is the KIT gene 
locus (e.g. the pig KIT gone locus). 

24. The method of claim 23 wherein the KIT gene locus is analysed to determine 
whether it carries any polymorphism associated with Belt genotype. 

25. The method of claim 24 wherein the determmation comprises RFLP analysis. 
26 A method of determining the coat colour genotype of a pig which comprises: 

(i) obtaining a sample of pig nucleic acid: and 

(ii) analysing the nucleic acid obtained in (i) to determine whether the KIT 
gene carries any polymorphism associated with Belt genotype. 

27. A method as claimed in claim 26 wherein step (ii) comprises RFLP analysis. 

28. A method as claimed in claim 26 or claim 27 wherein a sample of pig genomic 
DNA is amplified using PGR and a pair of suitable primers. 

29. The method of claim 21 wherein the additional locus is a breed determinant 
gene locus selected from any of those genes specified in claim 6, 

30. The method of claim 21 wherein the additional locus is a breed specific marker. 

3 1 . The method of claim 30 wherein the breed specific marker is a microsatellite 
marker. 

32. The method of any one of claims 7, 10, 1 1, 14-23 and 28-31 wherein the analysis 
step (ii) comprises PGR using at least one pair of suitable primers. 
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33. The method of claim 32 wherein the gene is the pig olMSHR gene and at least one 
pair of suitable primers is: 

aMSHR Forward Primer 1 : (5'-TGT AAA ACG ACG GCC AGT RGT GCC TGG 
AGG TGT -30; 

aMSHR Reverse Primer 5: (5'-CGC CCA GAT GGC CGC GAT GGA CCG-3'); or 

aMSHR Forward Primer 2: (5'-CGG CCA TCT GGG CGG GCA GCG TGC-3') 
aMSHR Reverse Primer 2: (5'-GGA AGG CGT AGA TGA GGG GGT CCA-3'); or 
aMSHR Forward Primer 3: (5^-GCA CAT CGC CCG GCT CCA CAA GAC-3') 
aMSHR Reverse Primer 3: (5'-GGG GCA GAG GAC GAC GAG GGA GAG-3'). 

34. The method of any one of claims 1, 10, 1 1, 14-23 and 28-33 wherein the analysis 
step (ii) comprises restriction fragment length polymorphism (RFLP) analysis, for 
example involving digesting the pig nucleic acid with one or more of the restriction 
enzjmies BstUl, Hhal and/or ^^pHI. 

35. The method of claim 34 wherein the gene is the pig ocMSHR gene and the 
analysis uivolves identification of a polymorphism at nucleotide position 283, 305, 
363, 370, 491, 727, 729 1 162 or between nucleotide positions 60 and 70 or between 
nucleotide positions 1005 and 101 Oof the sequence of the pig aMSHR gene. 

36. The method of claim 7 wherein the analysis step (ii) carrying out one or more 
PCR amplification cycles of the nucleic acid sample using at least one pair of suitable 
primers and then carrying out RFLP analysis on the amplified nucleic acid so 
obtained to determine the KIT or aMSHR genotype of the pig. 

37. The method of claim 36 wherein the gene is the pig aMSHR gene and the at least 
one pair of suitable primers is as defined in claim 35. 

38. The method of claim 30 or 3 1 wherein the gene is the pig KIT or aMSHR gene 
and the RFLP analysis is as defined in claim 28. 
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39. The method of any one of clauns 1-9, 1 1, 13-19, 21-25 and 29-38 wherein the 
animal product is meat (e.g, processed and/or canned meat), egg, egg swab or 
washing, semen, wool or leather. 

40. The method of any one of claims 1-9, 11, 13-19 and 21-39 wherein the sample 
comprises genomic DNA, RNA or mitochondrial DNA. 

41. The method of any one of claims 1-9, 11, 13-19 and 21-40 wherein the animal is 
a mammal (e.g. pig, cattle, dog, cat, horse, sheep, rodent or rabbit), fish (e.g, salmon 
or trout) or bird (chicken or turkey). 

42. A kit for: 

(a) differentiating animal products on the basis of breed origin; or 

(b) determining or testing the breed origin of an animal product; or 

(c) validating an animal product; 

comprising one or more reagents for analysing the allele(s) of one or more breed 
determinant genes present in the sample. 

43. A kit for determining the coat colour genotype of a pig, comprising one or 
more reagents for analysing the aMSHR genotype of the pig. 

44. A kit as claimed in claim 42 or claim 43 which is adapted to be used with a 
sample of pig genomic DNA. 

45. A kit as claimed in any on of claims 42 to 44 comprising one or more reagents 
for carrying out at least one cycle of PCR together with at least one pair of suitable 
primers. 

46. A kit as claimed in claim 45 wherein the atleast one pair of suitable primers is: 

aMSHR Forward Primer 1 : (5'-TGT AAA ACG ACG GCC AGT RGT GCC TGG 
AGG TGT CCA T-3') 
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aMSHR Reverse Primer 5: (5'-CGC CCA GAT GGC CGC GAT GGA CCG-3'); or 
aMSHR Forward Primer 2: (5'-CGG CCA TCT GGG CGG GCA GCG TGC-3') 
aMSHR Reverse Primer 2: (5'-GGA AGG CGT AGA TGA GGG GGT CCA-3'); or 
aMSHR Forward Primer 3: (5'-GCA CAT CGC CCG GCT CCA CAA GAC-3') 
aMSHR Reverse Primer 3: (5'-GGG GCA GAG GAC GAC GAG GGA GAG-3'). 



47. A kit as claimed in any one of claims 42 to 46 which comprises one or more 
reagents for RFLP analysis of pig nucleic acid. 
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Abstract 



Methods for Analyzing Animal Products 

The present invention provides methods for analysing 
animal products. In particular methods for 
differentiating animal products on the basis of breed 
origin or validating an animal product are provided. 
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FIG.1 

a: Nucleotide sequence 



1 27 
Wxld Boar CTCCCTGCTCCCTGCTCCCTGGCGGGACG ATG CCT GTG CTT GGC CCG GAG AGG AGG 



Meishan 
Pietrain 
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Wild Boar CTG CTG GCT TCC CTC AGC TCC GCG CCC CCA GCC GCC CCC ** CGG CCG CCA 




Meishan 
Pietrain 



CC 



126 

Wild Boar ACG CCT CGG GCT CAG ACC AAC CAG ACG GGC CCC CAG TGC CTG GAG GTG TCC 



Meishan 
Pietrain 



177 

Wildboar ATT CCC GAC GGG CTC TTC CTC AGC CTG GGG CTG GTG AGC CTC GTG GAG AAC 

Meishan 

Pietrain 

Largewhite 

Hampshire 

Duroc 
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Wildboar GTG CTG GTG GTG GCC GCC ATC GCC AAG AAC CGC AAC CTG CAC TCG CCC ATG 

Meishan 

Pietrain 

Largewhite ' " ' * " [ 

Hampshire ] 

Duroc 
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Wildboar TAG TAG TTC GTC TGC TGC CTG GCC GTG TCG GAC CTG CTG GTG AGC GTG AGC 
Meishan 

Pietrain ] * * * ] 

Largewhite 

Hampshire * * ' * 

Duroc 
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Wildboar AAC GTG CTG GAG ACG GCC GTG CTG CTG CTG CTG GAG GCG GGC GCC CTG GCC 

Meishan ... A C 

Pietrain [ [ * ] ] 

Largewhite * ] " ' [ 

Hampshire ] ' * 

Duroc 
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Wildboar GCC CAG GCC GCC GTG GTG CAG CAG CTG GAC AAT GTC ATG GAC GTG CTC ATC 

Meishan p 

Pietrain * ^ 

Largewhite , 

Hampshire \^ ' ' ' 2, 

Duroc * 
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ccc 


CGC 


GCG 


GGG 
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GTC 
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Wildboar ACG CAG CAC CCC ACC CGC CAG GGC TGC GGC CTC AAG GGC GCG GCC ACC CTC 
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789 

Wildboar ACC ATC CTG CTG GGC GTC TTC CTC CTC TGC TGG GCA CCC TTC TTC CTG CAC 

Meishan 

Pietrain 

Largewhite 

Hampshire 

Duroc , 
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Wildboar CTC TCC CTC GTC GTC CTC TGC CCC CAG CAC CCC ACC TGC GGC TGC GTC TTC 
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Wildboar AAG AAC GTC AAC CTC TTT CTG GCC CTC GTC ATC TGC AAC TCC ATC 
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Pietrain 
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b. Amino acid sequence 
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Wildboar ?PNGLFLSLG LVSLVENVLV VAAIAKNRNL HSPMYYFVCC LAVSDLLVSV SNVLETAVLL 

Meishan , . . • M P 
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Wildboar LLEAGALAAQ AAVVQQLDNV MDVLICGSMV SSLCFLGAIA VDRYVSIFYA LRYHSIVTLP 

Meishan 
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Meishan , , 
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Wildboar HIARLHKTQH PTRQGCGLKG AATLTILLGV FLLCWAPFFL HLSLVVLCPQ HPTCGCVFKN 
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Sequence Alignment Across the Exon /Intron Border of KIT Exon 17 



Exon 17 Intron 17 

\i 

I KITl AAT TAG GTG GTC AAA GGA AAC | GTG AGT ACC CAC GCT CTC CTG ACA GTC 
KIT2 .,.IA 

KITl IG 

KITl IG 

i KITl |G.. 



FIG. 5 
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FIG. 6C 
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Ratio of normal to splice mutant KIT 
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Ratios for Landrace and Large White 




LaiHlrace Breed Large White 



FIG. 8 
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1 

ATG AGA GGC GCT CGC CGC GCC TGG GAT TTT CTC TTC GTC CTG GAG CTC TTG 
52 

CTT CGC GTC CAG ACA GGC TCT TCT CAG CCA TCT GTG AGT CCA GAG GAA CTG 
103 

TCT CCA CCA TCC ATC CAT CCA GCA AAA TCA GAG TTA ATC GTC AGT GCT GGC 
154 

GAT GAG ATT AGG CTG TTC TGC ACC GAT CCA GGA TCT GTC AAA TGG ACT TTT 
205 

GAG ACC CTG GGT CAG CTG AGT GAG AAT ACA CAC GCA GAG TGG ATC GTG GAG 
256 

AAA GCA GAG GCC ATG AAT ACA GGC AAT TAT ACA TGC ACC AAT GAA GGC GGT 
307 

TTA AGC AGT TCC ATT TAT GTG TTT GTT AGA GAT CCT GAG AAG CTT TTC CTC 
358 

GTC GAC CCT CCC TTG TAT GGG AAG GAG GAC AAT GAC GCG CTG GTC CGA TGT 
409 

CCT CTG ACG GAC CCA GAG GTG ACC AAT TAC TCC CTC ACG GGC TGC GAG GGG 
460 

AAA CCC CTT CCC AAG GAT TTG ACC TTC GTC GCG GAC CCC AAG GCC GGC ATC 

511 

ACC ATC AGA AAC GTG AAG CGC GAG TAT CAT CGG CTC TGT CTC CAC TGC TCC 
562 

GCC AAC CAG GGG GGC AAG TCC GTG CTG TCG AAG AAA TTC ACC CTG AAA GTG 
613 

AGG GCA GCC ATC AGA GCT GTA CCT GTT GTG GCT GTG TCC AAA GCA AGC TAC 
664 

CTT CTC AGG GAA GGG GAG GAA TTT GCC GTG ATG TGC TTG ATC AAA GAC GTG 
715 

TCT AGT TCC GTG GAC TCC ATG TGG ATC AGG GAG AAC AGC CAG ACT AAA GCA 
766 

CAG GTG AAG AGG AAT AGC TGG CAT CAG GGT GAC TTC AAT TTT CTG CGG CAG 
817 

GAA AGG CTG ACA ATC AGC TCA GCA AGA GTT AAT GAT TCT GGC GTG TTC ATG 
868 

TGT TAC GCC AAT AAT ACT TTT GGA TCT GCA AAT GTC ACA ACC ACC TTA GAA 
919 

GTA GTA GAT AAA GGA TTC ATT AAT ATC TTC CCT ATG ATG AAT ACC ACT GTG 
970 

TTT GTA AAC GAT GGA GAG GAT GTG GAT CTA ATT GTT GAG TAC GAG GCG TAC 
1021 

CCC AAA CCT GAA CAC CGA CAG TGG ATA TAT ATG AAC CGC ACT GCC ACT GAT 
1072 

AAG TGG GAG GAT TAT CCC AAG TCT GAG AAT GAA AGT AAC ATC AGA TAT GTA 
1123 

AGT GAA CTT CAC TTG ACC AGA TTA AAA GGG ACC GAA GGA GGC ACT TAC ACA 
1174 

TTT CTC GTG TCC AAT GCT GAT GTC AAT TCT TCT GTG ACA TTT AAT GTT TAC 
1225 

GTG AAC ACA AAA CCA GAA ATC CTG ACT CAT GAC AGG CTC ATG AAC GGC ATG 

1276 

CTC CAG TGT GTG GCG GCA GGC TTC CCA GAG CCC ACC ATC GAT TGG TAT TTC 
1327 

TGT CCA GGC ACC GAG CAG AGA TGT TCC GTT CCC GTT GGG CCA GTG GAC GTG 

1378 

CAG ATC CAA AAC TCA TCT GTA TCA CCG TTT GGA AAA CTA GTG ATT CAC AGC 
1429 
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TCC ATT GAT TAC AGT GCA TTC AAA CAC AAC GGC ACG GTG GAG TGC AGG GCT 
1480 

TAC AAC GAT GTG GGC AAG AGT TCT GCC TTT TTT AAC TTT GCA TTT AAA GAA 
1531 

CAA ATC CAT GCC CAC ACC CTC TTC ACG CCT TTG CTG ATT GGT TTT GTG ATC 
1582 

GCA GCG GGT ATG ATG TGT ATC ATC GTG ATG ATT CTC ACC TAT AAA TAT CTA 
1633 

CAG AAG CCC ATG TAT GAA GTA CAG TGG AAG GTT GTC GAG GAG ATA AAT GGA 
1684 

AAC AAT TAT GTC TAC ATA GAC CCA ACG CAA CTT CCT TAT GAT CAC AAA TGG 
1735 

GAA TTT CCC AGG AAC AGG CTG AGT TTT GGC AAA ACC TTG GGT GCT GGC GCC 
1786 

TTC GGG AAA GTC GTT GAG GCC ACT GCA TAC GGC TTA ATT AAG TCA GAT GCG 
1837 

GCC ATG ACC GTT GCC GTG AAG ATG CTC AAA CCA AGT GCC CAT TTA ACG GAA 
1888 

CGA GAA GCC CTA ATG TCT GAA CTC AAA GTC TTA AGT TAC CTC GGT AAT CAC 
1939 

ATG AAT ATT GTG AAT CTT CTC GGC GCC TGC ACC ATT GGA GGG CCC ACC CTG 
1990 

GTC ATT ACA GAA TAT TGT TGC TAT GGT GAT CTC CTG AAT TTT TTG AGA CGG 
2041 

AAA CGT GAT TCG TTT ATT TGC TCA AAG CAG GAA GAT CAC GCA GAA GCG GCG 
2092 

CTT TAT AAG AAC CTT CTG CAT TCA AAG GAG TCT TCC TGC AGT GAC AGT ACT 
2143 

AAC GAG TAC ATG GAC ATG AAA CCC GGA GTG TCT TAT GTG GTA CCA ACC AAG 
2194 

GCA GAC AAA AGG AGA-^CT GCG AGA ATA GGC TCA TAC ATA GAA CGA GAT GTG 
2245 

ACT CCT GCC ATC ATG GAA GAT GAT GAG TTG GCC CTA GAC CTG GAG GAC TTG 
2296 

CTC AGC TTT TCT TAC CAA GTG GCA AAG GGC ATG GCC TTC CTC GCC TCG AAG 
2347 

AAT TGT ATT CAC AGA GAC TTG GCG GCC AGA AAT ATC CTC CTT ACT CAT GGT 
2398 

CGA ATC ACA AAG ATT TGT GAT TTT GGT CTA GCC AGA GAC ATC AAG AAT GAT 
2449 

TCT AAT TAC GTG GTC AAA GGA AAC GCT CGG CTA CCC GTG AAG TGG ATG GCA 

2500 

CCT GAG AGC ATT TTC AAC TGT GTC TAC ACA TTT GAA AGC GAT GTC TGG TCC 
2551 

TAT GGG ATT TTT CTG TGG GAG CTC TTC TCT TTA GGG AGC AGC CCC TAC CCC 
2602 

GGA ATG CCA GTT GAT TCT AAA TTC TAC AAG ATG ATC AAG GAG GGT TTC CGA 
2653 

ATG CTC AGC CCT GAG CAT GCA CCT GCG GAA ATG TAT GAC ATC ATG AAG ACT 
2704 

TGC TGG GAT GCG GAT CCC CTC AAA AGA CCA ACG TTT AAG CAG ATC GTG CAG 

2755 

CTG ATT GAG AAG CAG ATT TCG GAG AGC ACC AAT CAC ATT TAT TCC AAC TTA 
2806 

GCG AAC TGC AGC CCC CAC CGG GAG AAC CCC GCG GTG GAT CAT TCT GTG CGG 
2857 

ATC AAC TCC GTG GGC AGC AGT GCC TCC TCC ACG CAG CCT CTG CTT GTC CAC 
2908 

GAA GAT GTC TGA 



FIG.IOCONT'D 
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1012 

Wild Boar CTGCAGTGCT CCTGGTGAGG GGGGACGGGC 

Meishan ' 

Large Black : 

Hampshire 

Pietrain 

Duroc 

1062 

Wild Boar GCTGGAGCCA GGCTGCGGGG CTGAGGGCAG TGGTGCCGTC CTGCGGCCCG 

Meishan 

Large Black 

Hampshire 

Pietrain 

Duroc 

1112 

Wild Boar GTTCCTACGT GGCTGGGCAG CCCCTTGGCA GAGAGGACGG GCCGGACATC 

Meishan 

Large Black 

Hampshire 

Pietrain 

Duroc 

1162 

Wild Boar TCTGAAGGTA TGGACGCTGG ACCCTCTGGG GCCCGACAGA GGAAGAGCCA 

Meishan ^ 

Large Black ^ 

Hampshire ^ 

Pietrain G 

Duroc ^ 

1212 

Wild Boar GCACTTCCAG GAGGCATGGG GAGTGGGGGA GGCTGGAGAG ACGGCGGGGA 

Meishan 

Large Black 

Hampshire 

Pietrain 

Duroc 

1262 

Wild Boar GCGCCACCTC CATCCAGAGA CCACCACGCC CGCCTTTGGG GCGCGCTCTG 

Meishan 

Large Black 

Hampshire 

Pietrain 

Duroc 



FIG. 12 
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1312 



Wild Boar GGGACTTTGC CCCCCACTGG GGTGGGACGT GTGCGGGCAG AAGCTGTCCG 

Meishan 

Large Black 

Hampshire 

Pietrain 

Duroc 



Wild Boar GGTGTTGCTC ACTGCAGGAC CTCAGGGGAA GGCCTTCGTG ACTGCTAGGA 

Meishan 

Large Black 

Hampshire 

Pietrain 

Duroc 



Wild Boar AGCAGGCGCA GCGCCCCGGC GGAGGGCGGG GCCCCTCTCT TCTACGGCTC 

Meishan 

Large Black 

Hampshire 

Pietrain 

Duroc 

Wild Boar AGTG 

Meishan 

Large Black 

Hampshire 

Duroc 



1362 



1412 




SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: PIG IMPROVEMENT COMPANY UK LIMITED 

(B) STREET: 100 GEORGE STREET 

(C) CITY: LONDON 

(D) STATE: LONDON 

(E) COUNTRY: ENGLAND 

(F) POSTAL CODE (ZIP) : WIH 5RH 

(A) NAME: ANDERSSON, LEIF 

(B) STREET: MELICA HB 

(C) CITY; BERGAGATAN 30 

(D) STATE: UPPSALA 

(E) COUNTRY: SWEDEN 

(F) POSTAL CODE (ZIP) ; S-752 39 

(ii) TITLE OF INVENTION: METHODS FOR ANALYSING ANIMAL PRODUCTS 
(iii) NUMBER OF SEQUENCES: 50 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1,0, Version #1.30 (EPO) 

(v) CURRENT APPLICATION DATA: 

APPLICATION NUMBER: WO PCT/GB98/ 01531 

(2) INFORMATION FOR SEQ ID NO : 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "PRIMER" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1: 
TGTAAAACGA CGGCCAGTRG TGCCTGGAGG TGT 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "PRIMER MSHR REVERSE 5" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
CGCCCAGATG GCCGCGATGG ACCG 24 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3: 
CGGCCATCTG GGCGGGCAGC GTGC 24 
(2) INFORMATION FOR SEQ ID NO : 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GGAAGGCGTA GATGAGGGGG TCCA 24 
(2) INFORMATION FOR SEQ ID NO : 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5: 
GCACATCGCC CGGCTCCACA AGAC 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "PRIMER MSHR REVERSE 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
GGGGCAGAGG ACGACGAGGG AGAG 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc = "PRIMER LA93 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7: 
GAGCAGCCCC TACCCCGGAA TGCCAGTTGA 
(2) INFORMATION FOR SEQ ID NO : 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "PRIMER KIT56 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



CTTTAAAACA GAACATAAAA GCGGAAACAT CATGCGAAGG 
(2) INFORMATION FOR SEQ ID NO : 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc = "Primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9: 
CGCCCAGATG GCCGCGATGG ACCG 
(2) INFORMATION FOR SEQ ID NO : 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc = "Primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 
TGCGCTACCA CAGCATCGTG ACCCTGC 
(2) INFORMATION FOR SEQ ID NO : 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 
GTAGTAGGCG ATGAAGAGCG TGCT 



(2) INFORMATION FOR SEQ ID NO: 12: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CTGCCTGGCC GTGTCGGACC TG 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CTGTGGTAGC GCAGCGCGTA GAAG 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc = "Primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
TGAGGTAGGA GAGTTTTGGG 
(2) INFORMATION FOR SEQ ID NO : 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
TCGAAATTGA GGGGAAGACC 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 
GTATTCACAG AGACTTGGCG GC 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc - "Primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 
AAACCTGCAA GGAAAATCCT TCACGG 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: other nucleic acid 



(A) DESCRIPTION: /desc = "Primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
GAATATTGTT GCTATGGTGA TCTCC 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CCGCTTCTGC GTGATCTTCC TG 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc = "Primer' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 
CTGGATGTCC TGTGTTCCCT GT 
(2) INFORMATION FOR SEQ ID NO : 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 21: 
AGGTTTGTCT GCAGCAGAAG CTC 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc ^ "Primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
GAAAGTGAYG TCTGGTCCTA TSGGAT 
(2) INFORMATION FOR SEQ ID NO : 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc = "Primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
AGCCTTCCTT GATCATCTTG TAG 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
TGTGGGAGCT CTTCTCTTTA GG 



(2) INFORMATION FOR SEQ ID NO: 25: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc = "Primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 5 
CCAGCAGGAC AATGGGAACA TCT 
(2) INFORMATION FOR SEQ ID NO : 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc - "Primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 
GGCTCTGGGG GCTCGGCTTT GC 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 27 

TCAGACATCT TCGTGGACAA GCAGAGG 

(2) INFORMATION FOR SEQ ID NO : 28: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 base pairs 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 8 
TCRTACATAG AAAGAGAYGT GACTC 
(2) INFORMATION FOR SEQ ID NO : 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 
AGCCTTCCTT GATCATCTTG TAG 
(2) INFORMATION FOR SEQ ID NO : 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc = "Primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 0 
GAGCAGCCCC TACCCCGGAA TGCCAGTTGA 
(2) INFORMATION FOR SEQ ID NO : 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc = "Primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 31 
CTTTAAAACA GAA.CATAAAA GCGGAAACAT CATGCGAAGG 
(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 32 
GGTCTAGATC ACCAGGAGCA CTGCAGCACC 
(2) INFORMATION FOR SEQ ID NO : 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 3 
GGGAAGCTTG ACCCCCGAGA GCGACGCGCC 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
GGACTCCATG GAGCCGCAGA TGAGCACGGT 3 0 

(2) INFORMATION FOR SEQ ID NO : 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc = "Primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
GCAAGACCCT CCAGGAGGTG 2 0 

(2) INFORMATION FOR SEQ ID NO : 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 36: 
CACTGAGCCG TAGAAGAGAG 2 0 

(2) INFORMATION FOR SEQ ID NO : 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1. .863 

(D) OTHER INFORMATION :/note= "Fig 1 - Wildboar" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 37: 

CTCCCTGCTC CCTGCTCCCT GGCGGGACGA TGCCTGTGCT TGGCCCGGAG AGGAGGCTGC 60 

TGGCTTCCCT CAGCTCCGCG CCCCCAGCCG CCCCCCGGCC GCCAACGCCT CGGGCTCAGA 12 0 

CCAACCAGAC GGGCCCCCAG TGCCTGGAGG TGTCCATTCC CGACGGGCTC TTCCTCAGCC 180 

TGGGGCTGGT GAGCCTCGTG GAGAACGTGC TGGTGGTGGC CGCCATCGCC AAGAACCGCA 240 

ACCTGCACTC GCCCATGTAC TACTTCGTCT GCTGCCTGGC CGTGTCGGAC CTGCTGGTGA 3 00 

GCGTGAGCAA CGTGCTGGAG ACGGCCGTGC TGCTGCTGCT GGAGGCGGGC GCCCTGGCCG 3 60 

CCCAGGCCGC CGTGGTGCAG CAGCTGGACA ATGTCATGGA CGTGCTCATC TGCGGCTCCA 42 0 

TGGTGTCCAG CCTCTGCTTC CTGGGCGCCA TCGCCGTGGA CCGCTACGTG TCCATCTTCT 480 

ACGCGCTGCG CTACCACAGC ATCGTGACGC TGCCCCGCGC GGGGCGGGCT ATCGCGGCGA 540 

TCTGGGCGGG CAGCGTGCTC TCCGTCAGCT TCTTCGTGGC CATGCTGGCG CTCATGGCGG 600 

TACTGTACGT CCACATGCTG GCCCGGGCCT GCCAGCACGG CCGGCACATC GCCCGGCTCC 660 

ACAAGACGCA GCACCCCACC CGCCAGGGCT GCGGCCTCAA GGGCGCGGCC ACCCTCACCA 72 0 

TCCTGCTGGG CGTCTTCCTC CTCTGCTGGG CACCCTTCTT CCTGCACCTC TCCCTCGTCG 7 80 

TCCTCTGCCC CCAGCACCCC ACCTGCGGCT GCGTCTTCAA GAACGTCAAC CTCTTTCTGG 840 

CCCTCGTCAT CTGCAACTCC ATC 863 
(2) INFORMATION FOR SEQ ID NO : 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1. .7 05 

(D) OTHER INFORMATION :/note= "Fig 1 - Meishan" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 38: 

CTCCCTGCTC CCTGCTCCCT GGCGGGACGA TGCCTGTGCT TGGCCCGGAG AGGAGGCTGC 60 

TGGCTTCCCT CAGCTCCGCG CCCCCAGCCG CCCCCCGGCC GCCAACGCCT CGGGCTCAGA 12 0 

CCAACCAGAC GGGCCCCCAG TGCCTGGAGG TGTCCATTCC CGACGGGCTC TTCCTCAGCC 180 



TGGGGCTGGT GAGCCTCGTG GAGAACGTGC TGGTGGTGGC CGCCATCGCC AAGAACCGCA 240 

ACCTGCACTC GCCCATGTAC TACTTCGTCT GCTGCCTGGC CGTGTCGGAC CTGCTGGTGA 3 00 

GCGTGAGCAA CATGCTGGAG ACGGCCGTGC TGCCGCTGCT GGAGGCGGGC GCCCTGGCCG 360 

CCCAGGCCGC CGTGGTGCAG CAGCTGGACA ACGTCATGGA CGTGCTCATC TGCGGCTCCA 42 0 

TGGTGTCCAG CCTCTGCTTC CTGGGCGCCA TCGCCGTGGA CCGCTACGTG TCCATCTTCT 480 

ACGCGCTGCG CTACCACAGC ATCGTGACGC TGCCCCGCGC GGGGCGGGCT ATCGCGGCGA 540 

TCTGGGCGGG CAGCGTGCTC TCCGTCAGCT TCTTCGTGGC CATGCTGGCG CTCATGGCGG 600 

TACTGTACGT CCACATGCTG GCCCGGGCCT GCCAGCACGG CCGGCACATC GCCCGGCTCC 660 

AC7VAGACGCA GCACCCCACC CGCCAGGGCT GCGGCCTCAA GGGCGCAGCC ACCCTCACCA 72 0 

TCCTGCTGGG CGTCTTCCTC CTCTGCTGGG CACCCTTCTT CCTGCACCTC TCCCTCGTCG 780 

TCCTCTGCCC CCAGCACCCC ACCTGCGGCT GCGTCTTCAA GAACGTCAAC CTCTTTCTGG 840 

CCCTCGTCAT CTGCAACTCC ATC 8 63 
(2) INFORMATION FOR SEQ ID NO : 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 865 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE : 

(A) NAME /KEY: misc_f eature 

(B) LOCATION: 1. .7 05 

(D) OTHER INFORMATION :/note= "Fig 1 - Pietrain" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 39: 
CTCCCTGCTC CCTGCTCCCT GGCGGGACGA TGCCTGTGCT TGGCCCGGAG AGGAGGCTGC 60 

TGGCTTCCCT CAGCTCCGCG CCCCCAGCCG CCCCCCCCGG CCGCCAACGC CTCGGGCTCA 12 0 

GACCAACCAG ACGGGCCCCC AGTGCCTGGA GGTGTCCATT CCCGACGGGC TCTTCCTCAG 180 

CCTGGGGCTG GTGAGCCTCG TGGAGAACGT GCTGGTGGTG GCCGCCATCG CCAAGAACCG 240 

CAACCTGCAC TCGCCCATGT ACTACTTCGT CTGCTGCCTG GCCGTGTCGG ACCTGCTGGT 3 00 

GAGCGTGAGC AACGTGCTGG AGACGGCCGT GCTGCTGCTG CTGGAGGCGG GCGCCCTGGC 3 60 

CGCCCAGGCC GCCGTGGTGC AGCAGCTGGA CAATGTCATG AACGTGCTCA TCTGCGGCTC 42 0 



CATGGTGTCC AGCCTCTGCT TCCTGGGCGC CATCGCCGTG GACCGCTACG TGTCCATCTT 4 80 

CTACGCGCTG CGCTACCACA GCATCGTGAC GCTGCCCCGC GCGGGGCGGG CTATCGCGGC 540 

GATCTGGGCG GGCAGCGTGC TCTCCGTCAG CTTCTTCGTG GCCATGCTGG CGCTCATGGC 600 

GGTACTGTAC GTCCACATGC TGGCCCGGGC CTGCCAGCAC GGCCGGCACA TCGCCCGGCT 660 

CCACAAGACG CAGCACCCCA CCCGCCAGGG CTGCGGCCTC AAGGGCGCGG CCACCCTCAC 72 0 

CATCCTGCTG GGCGTCTTCC TCCTCTGCTG GGCACCCTTC TTCCTGCACC TCTCCCTCGT 7 80 

CGTCCTCTGC CCCCAGCACC CCACCTGCGG CTGCGTCTTC AAGAACGTCA ACCTCTTTCT 84 0 

GGCCCTCGTC ATCTGCAACT CCATC 865 
(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 05 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: Tnisc_f eature 

(B) LOCATION: 1. .705 

(D) OTHER INFORMATION: /not e= "Fig 1 - Largewhite" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 40: 

CCCGACGGGC TCTTCCTCAG CCTGGGGCTG GTGAGCCTCG TGGAGAACGT GCTGGTGGTG 60 

GCCGCCATCG CCAAGAACCG CAACCTGCAC TCGCCCATGT ACTACTTCGT CTGCTGCCTG 12 0 

GCCGTGTCGG ACCTGCTGGT GAGCGTGAGC AACGTGCTGG AGACGGCCGT GCTGCTGCTG 180 

CTGGAGGCGG GCGCCCTGGC CGCCCAGGCC GCCGTGGTGC AGCAGCTGGA CAATGTCATG 24 0 

AACGTGCTCA TCTGCGGCTC CATGGTGTCC AGCCTCTGCT TCCTGGGCGC CATCGCCGTG 300 

GACCGCTACG TGTCCATCTT CTACGCGCTG CGCTACCACA GCATCGTGAC GCTGCCCCGC 3 60 

GCGGGGCGGG CTATCGCGGC GATCTGGGCG GGCAGCGTGC TCTCCGTCAG CTTCTTCGTG 42 0 

GCCATGCTGG CGCTCATGGC GGTACTGTAC GTCCACATGC TGGCCCGGGC CTGCCAGCAC 480 

GGCCGGCACA TCGCCCGGCT CCACAAGACG CAGCACCCCA CCCGCCAGGG CTGCGGCCTC 54 0 

AAGGGCGCGG CCACCCTCAC CATCCTGCTG GGCGTCTTCC TCCTCTGCTG GGCACCCTTC 60 0 

TTCCTGCACC TCTCCCTCGT CGTCCTCTGC CCCCAGCACC CCACCTGCGG CTGCGTCTTC 660 



AAGAACGTCA ACCTCTTTCT GGCCCTCGTC ATCTGCAACT CCATC 



705 



(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 708 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1. .708 

(D) OTHER INFORMATION:/ not e= "Fig 1 - Hampshire" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

ATTCCCGACG GGCTCTTCCT CAGCCTGGGG CTGGTGAGCC TCGTGGAGAA CGTGCTGGTG 60 

GTGGCCGCCA TCGCCAAGAA CCGCAACCTG CACTCGCCCA TGTACTACTT CGTCTGCTGC 12 0 

CTGGCCGTGT CGGACCTGCT GGTGAGCGTG AGCAACGTGC TGGAGACGGC CGTGCTGCTG 180 

CTGCTGGAGG CGGGCGCCCT GGCCGCCCAG GCCGCCGTGG TGCAGCAGCT GGACAATGTC 24 0 

ATGAACGTGC TCATCTGCGG CTCCATGGTG TCCAGCCTCT GCTTCCTGGG CGCCATCGCC 3 00 

GTGGACCGCT ACGTGTCCAT CTTCTACGCG CTGCGCTACC ACAGCATCGT GACGCTGCCC 3 60 

CGCGCGGGGC GGGCTATCGC GGCGATCTGG GCGGGCAGCG TGCTCTCCGT CAGCTTCTTC 42 0 

GTGGCCATGC TGGCGCTCAT GGCGGTACTG TACGTCCACA TGCTGGCCCG GGCCTGCCAG 480 

CACGGCCGGC ACATCGCCCG GCTCCACAAG ACGCAGCACC CCACCCGCCA GGGCTGCGGC 54 0 

CTCAAGGGCG CGGCCACCCT CACCATCCTG CTGGGCGTCT TCCTCCTCTG CTGGGCACCC 60 0 

TTCTTCCTGC ACCTCTCCCT CGTCGTCCTC TGCCCCCAGC ACCCCACCTG CGGCTGCGTC 660 

TTCAAGAACG TCAACCTCTT TCTGGCCCTC GTCATCTGCA ACTCCATC 7 08 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 708 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION:!. .708 

(D) OTHER INFORMATION : /note= "Fig 1 - Duroc" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

ATTCCCGACG GGCTCTTCCT CAGCCTGGGG CTGGTGAGCC TCGTGGAGAA CGTGCTGGTG 60 

GTGGCCGCCA TCGCCAAGAA CCGCAACCTG CACTCGCCCA TGTACTACTT CGTCTGCTGC 12 0 

CTGGCCGTGT CGGACCTGCT GGTGAGCGTG AGCAACGTGC TGGAGACGGC CGTGCTGGTG 180 

CTGCTGGAGG CGGGCGCCCT GGCCGCCCAG GCCGCCGTGG TGCAGCAGCT GGACAATGTC 240 

ATGGACGTGC TCATCTGCGG CTCCATGGTG TCCAGCCTCT GCTTCCTGGG CGCCATCGCC 3 00 

GTGGACCGCT ACGTGTCCAT CTTCTACGCG CTGCGCTACC ACAGCATCGT GACGCTGCCC 360 

CGCGTGGGGC GGGCTATCGC GGCGATCTGG GCGGGCAGCG TGCTCTCCGT CAGCTTCTTC 42 0 

GTGGCCATGC TGGCGCTCAT GGCGGTACTG TACGTCCACA TGCTGGCCCG GGCCTGCCAG 4 80 

CACGGCCGGC ACATCGCCCG GCTCCACAAG ACGCAGCACC CCACCCGCCA GGGCTGCGGC 540 

CTCAAGGGCA CGGCCACCCT CACCATCCTG CTGGGCGTCT TCCTCCTCTG CTGGGCACCC 600 

TTCTTCCTGC ACCTCTCCCT CGTCGTCCTC TGCCCCCAGC ACCCCACCTG CGGCTGCGTC 660 

TTCAAGAACG TCAACCTCTT TCTGGCCCTC GTCATCTGCA ACTCCATC 70 8 
(2) INFORMATION FOR SEQ ID NO : 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 253 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
{ D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION:!. .253 

(D) OTHER INFORMATION: /note- "Fig 1 - Wild Boar" 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION:! 

(D) OTHER INFORMATION :/note= "X = unknown amino acid" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

Xaa Pro Asn Gly Leu Phe Leu Ser Leu Gly Leu Val Ser Leu Val Glu 



1 



5 



10 



15 



Asn Val Leu Val Val Ala Ala lie Ala Lys Asn Arg Asn Leu His Ser 
20 25 30 

Pro Met Tyr Tyr Phe Val Cys Cys Leu Ala Val Ser Asp Leu Leu Val 
35 40 45 

Ser Val Ser Asn Val Leu Glu Thr Ala Val Leu Leu Leu Leu Glu Ala 
50 55 SO 

Gly Ala Leu Ala Ala Gin Ala Ala Val Val Gin Gin Leu Asp Asn Val 
65 70 75 80 

Met Asp Val Leu lie Cys Gly Ser Met Val Ser Ser Leu Cys Phe Leu 
85 90 95 

Gly Ala He Ala Val Asp Arg Tyr Val Ser He Phe Tyr Ala Leu Arg 
100 105 110 

Tyr His Ser He Val Thr Leu Pro Arg Ala Gly Arg Ala He Ala Ala 
115 120 125 

He Trp Ala Gly Ser Val Leu Ser Ser Thr Leu Phe He Ala Tyr Tyr 
130 135 140 

His His Thr Ala Val Leu Leu Gly Leu Val Ser Phe Phe Val Ala Met 
145 150 155 160 

Leu Ala Leu Met Ala Val Leu Tyr Val His Met Leu Ala Arg Ala Cys 
165 170 175 

Gin His Gly Arg His He Ala Arg Leu His Lys Thr Gin His Pro Thr 
180 185 190 

Arg Gin Gly Cys Gly Leu Lys Gly Ala Ala Thr Leu Thr He Leu Leu 
195 200 205 

Gly Val Phe Leu Leu Cys Trp Ala Pro Phe Phe Leu His Leu Ser Leu 
210 215 220 

Val Val Leu Cys Pro Gin His Pro Thr Cys Gly Cys Val Phe Lys Asn 
225 230 235 240 

Val Asn Leu Phe Leu Ala Leu Val He Cys Asn Ser He 
245 250 

INFORMATION FOR SEQ ID NO: 44; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 53 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION:!. .253 

(D) OTHER INFORMATION :/note= "Fig 1 - Meishan" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION:! 

(D) OTHER INFORMATION :/note= "X = Unknown amino acid" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Xaa Pro Asn Gly Leu Phe Leu Ser Leu Gly Leu Val Ser Leu Val Glu 
15 10 !5 

Asn Val Leu Val Val Ala Ala lie Ala Lys Asn Arg Asn Leu His Ser 
20 25 30 

Pro Met Tyr Tyr Phe Val Cys Cys Leu Ala Val Ser Asp Leu Leu Val 
35 40 45 

Ser Val Ser Asn Met Leu Glu Thr Ala Val Leu Pro Leu Leu Glu Ala 
50 55 60 

Gly Ala Leu Ala Ala Gin Ala Ala Val Val Gin Gin Leu Asp Asn Val 
65 70 75 80 

Met Asp Val Leu lie Cys Gly Ser Met Val Ser Ser Leu Cys Phe Leu 
85 90 95 

Gly Ala lie Ala Val Asp Arg Tyr Val Ser lie Phe Tyr Ala Leu Arg 
!00 105 110 

Tyr His Ser lie Val Thr Leu Pro Arg Ala Gly Arg Ala lie Ala Ala 
115 120 125 

lie Trp Ala Gly Ser Val Leu Ser Ser Thr Leu Phe lie Ala Tyr Tyr 
130 135 140 

His His Thr Ala Val Leu Leu Gly Leu Val Ser Phe Phe Val Ala Met 
145 150 155 160 

Leu Ala Leu Met Ala Val Leu Tyr Val His Met Leu Ala Arg Ala Cys 
165 170 175 

Gin His Gly Arg His lie Ala Arg Leu His Lys Thr Gin His Pro Thr 
180 185 190 

Arg Gin Gly Cys Gly Leu Lys Gly Ala Ala Thr Leu Thr lie Leu Leu 
195 200 205 

Gly Val Phe Leu Leu Cys Trp Ala Pro Phe Phe Leu His Leu Ser Leu 
210 215 220 



Val Val Leu Cys Pro Gin His Pro Thr Cys Gly Cys Val Phe Lys Asn 



225 



230 



235 



240 



Val Asn Leu Phe Leu Ala Leu Val lie Cys Asn Ser lie 
245 250 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 253 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION:!. .253 

(D) OTHER INFORMATION :/note= "Fig 1 - Largewhite and 
Hampshire " 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION:! 

(D) OTHER INFORMATION :/note= "X = unknown amino acid" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

Xaa Pro Asn Gly Leu Phe Leu Ser Leu Gly Leu Val Ser Leu Val Glu 
15 10 15 

Asn Val Leu Val Val Ala Ala lie Ala Lys Asn Arg Asn Leu His Ser 
20 25 30 

Pro Met Tyr Tyr Phe Val Cys Cys Leu Ala Val Ser Asp Leu Leu Val 
35 40 45 

Ser Val Ser Asn Val Leu Glu Thr Ala Val Leu Leu Leu Leu Glu Ala 
50 55 60 

Gly Ala Leu Ala Ala Gin Ala Ala Val Val Gin Gin Leu Asp Asn Val 
65 70 75 80 

Met Asn Val Leu lie Cys Gly Ser Met Val Ser Ser Leu Cys Phe Leu 
85 90 95 

Gly Ala lie Ala Val Asp Arg Tyr Val Ser lie Phe Tyr Ala Leu Arg 
100 105 110 

Tyr His Ser lie Val Thr Leu Pro Arg Ala Gly Arg Ala lie Ala Ala 
115 120 125 

lie Trp Ala Gly Ser Val Leu Ser Ser Thr Leu Phe lie Ala Tyr Tyr 
130 135 140 



His His Thr Ala 
145 

Leu Ala Leu Met 



Gin His Gly Arg 
180 

Arg Gin Gly Cys 
195 

Gly Val Phe Leu 
210 

Val Val Leu Cys 
225 

Val Asn Leu Phe 



Val Leu Leu Gly 
150 

Ala Val Leu Tyr 
165 

His lie Ala Arg 



Gly Leu Lys Gly 
200 

Leu Cys Trp Ala 
215 

Pro Gin His Pro 
230 

Leu Ala Leu Val 
245 



Leu Val Ser Phe 
155 

Val His Met Leu 
170 

Leu His Lys Thr 
185 

Ala Ala Thr Leu 



Pro Phe Phe Leu 
220 

Thr Cys Gly Cys 
235 

lie Cys Asn Ser 
250 



Phe Val Ala Met 
160 

Ala Arg Ala Cys 
175 

Gin His Pro Thr 
190 

Thr lie Leu Leu 
205 

His Leu Ser Leu 



Val Phe Lys Asn 
240 

He 



(2) INFORMATION FOR SEQ ID NO: 46: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 253 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION :1. .253 

(D) OTHER INFORMATION :/note= "Fig 1 - Duroc" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION :1 

(D) OTHER INFORMATION :/note= "X = unknown amino acid" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 46: 

Xaa Pro Asn Gly Leu Phe Leu Ser Leu Gly Leu Val Ser Leu Val Glu 
15 10 15 

Asn Val Leu Val Val Ala Ala He Ala Lys Asn Arg Asn Leu His Ser 
20 25 30 

Pro Met Tyr Tyr Phe Val Cys Cys Leu Ala Val Ser Asp Leu Leu Val 
35 40 45 



Ser Val Ser Asn Val Leu Glu Thr Ala Val Leu Leu Leu Leu Glu Ala 
50 55 60 



Gly Ala Leu Ala Ala Gin Ala Ala Val Val Gin Gin Leu Asp Asn Val 
65 70 75 80 



Met Asp Val Leu lie Cys Gly Ser Met Val Ser Sex Leu Cys Phe Leu 
85 90 95 

Gly Ala He Ala Val Asp Arg Tyr Val Ser He Phe Tyr Ala Leu Arg 
100 105 110 

Tyr His Ser He Val Thr Leu Pro Arg Val Gly Arg Ala He Ala Ala 
115 120 125 

He Trp Ala Gly Ser Val Leu Ser Ser Thr Leu Phe He Ala Tyr Tyr 
130 135 140 

His His Thr Ala Val Leu Leu Gly Leu Val Ser Phe Phe Val Ala Met 
145 150 155 160 

Leu Ala Leu Met Ala Val Leu Tyr Val His Met Leu Ala Arg Ala Cys 
165 170 175 

Gin His Gly Arg His He Ala Arg Leu His Lys Thr Gin His Pro Thr 
180 185 190 

Arg Gin Gly Cys Gly Leu Lys Gly Thr Ala Thr Leu Thr He Leu Leu 
195 200 205 

Gly Val Phe Leu Leu Cys Trp Ala Pro Phe Phe Leu His Leu Ser Leu 
210 215 220 

Val Val Leu Cys Pro Gin His Pro Thr Cys Gly Cys Val Phe Lys Asn 
225 230 235 240 

Val Asn Leu Phe Leu Ala Leu Val He Cys Asn Ser He 
245 250 

(2) INFORMATION FOR SEQ ID NO : 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2919 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

ATGAGAGGCG CTCGCCGCGC CTGGGATTTT CTCTTCGTCC TGCAGCTCTT GCTTCGCGTC 60 

CAGACAGGCT CTTCTCAGCC ATCTGTGAGT CCAGAGGAAC TGTCTCCACC ATCCATCCAT 12 0 

CCAGCAAAAT CAGAGTTAAT CGTCAGTGCT GGCGATGAGA TTAGGCTGTT CTGCACCGAT 180 



CCAGGATCTG TCAAATGGAC TTTTGAGACC CTGGGTCAGC TGAGTGAGAA TACACACGCA 240 

GAGTGGATCG TGGAGAAAGC AGAGGCCATG AATACAGGCA ATTATACATG CACCAATGAA 3 00 

GGCGGTTTAA GCAGTTCCAT TTATGTGTTT GTTAGAGATC CTGAGAAGCT TTTCCTCGTC 360 

GACCCTCCCT TGTATGGGAA GGAGGACAAT GACGCGCTGG TCCGATGTCC TCTGACGGAC 420 

CCAGAGGTGA CCAATTACTC CCTCACGGGC TGCGAGGGGA AACCCCTTCC CAAGGATTTG 4 80 

ACCTTCGTCG CGGACCCCAA GGCCGGCATC ACCATCAGAA ACGTGAAGCG CGAGTATCAT 54 0 

CGGCTCTGTC TCCACTGCTC CGCCAACCAG GGGGGCAAGT CCGTGCTGTC GAAGAAATTC 600 

ACCCTGAAAG TGAGGGCAGC CATCAGAGCT GTACCTGTTG TGGCTGTGTC CAAAGCAAGC 660 

TACCTTCTCA GGGAAGGGGA GGAATTTGCC GTGATGTGCT TGATCAAAGA CGTGTCTAGT 72 0 

TCCGTGGACT CCATGTGGAT CAGGGAGAAC AGCCAGACTA AAGCACAGGT GAAGAGGAAT 7 80 

AGCTGGCATC AGGGTGACTT CAATTTTCTG CGGCAGGAAA GGCTGACAAT CAGCTCAGCA 840 

AGAGTTAATG ATTCTGGCGT GTTCATGTGT TACGCCAATA ATACTTTTGG ATCTGCAAAT 90 0 

GTCACAACCA CCTTAGAAGT AGTAGATAAA GGATTCATTA ATATCTTCCC TATGATGAAT 960 

ACCACTGTGT TTGTAAACGA TGGAGAGGAT GTGGATCTAA TTGTTGAGTA CGAGGCGTAC 102 0 

CCCAAACCTG AACACCGACA GTGGATATAT ATGAACCGCA CTGCCACTGA TAAGTGGGAG 1080 

GATTATCCCA AGTCTGAGAA TGAAAGTAAC ATCAGATATG TAAGTGAACT TCACTTGACC 1140 

AGATTAAAAG GGACCGAAGG AGGCACTTAC ACATTTCTCG TGTCCAATGC TGATGTCAAT 1200 

TCTTCTGTGA CATTTAATGT TTACGTGAAC ACAAAACCAG AAATCCTGAC TCATGACAGG 1260 

CTCATGAACG GCATGCTCCA GTGTGTGGCG GCAGGCTTCC CAGAGCCCAC CATCGATTGG 132 0 

TATTTCTGTC CAGGCACCGA GCAGAGATGT TCCGTTCCCG TTGGGCCAGT GGACGTGCAG 13 80 

ATCCAAAACT CATCTGTATC ACCGTTTGGA AAACTAGTGA TTCACAGCTC CATTGATTAC 1440 

AGTGCATTCA AACACAACGG CACGGTGGAG TGCAGGGCTT ACAACGATGT GGGCAAGAGT 15 00 

TCTGCCTTTT TTAACTTTGC ATTTAAAGAA CAAATCCATG CCCACACCCT CTTCACGCCT 1560 

TTGCTGATTG GTTTTGTGAT CGCAGCGGGT ATGATGTGTA TCATCGTGAT GATTCTCACC 162 0 

TATAAATATC TACAGAAGCC CATGTATGAA GTACAGTGGA AGGTTGTCGA GGAGATAAAT 1680 

GGAAACAATT ATGTCTACAT AGACCCAACG CAACTTCCTT ATGATCACAA ATGGGAATTT 1740 

CCCAGGAACA GGCTGAGTTT TGGCAAAACC TTGGGTGCTG GCGCCTTCGG GAAAGTCGTT 1800 

GAGGCCACTG CATACGGCTT AATTAAGTCA GATGCGGCCA TGACCGTTGC CGTGAAGATG 1860 

CTCAAACCAA GTGCCCATTT AACGGAACGA GAAGCCCTAA TGTCTGAACT CAAAGTCTTA 192 0 



AGTTACCTCG GTAATCACAT GAATATTGTG AATCTTCTCG GCGCCTGCAC CATTGGAGGG 1980 

CCCACCCTGG TCATTACAGA ATATTGTTGC TATGGTGATC TCCTGAATTT TTTGAGACGG 2 040 

AAACGTGATT CGTTTATTTG CTCAAAGCAG GAAGATCACG CAGAAGCGGC GCTTTATAAG 2100 

AACCTTCTGC ATTCAAAGGA GTCTTCCTGC AGTGACAGTA CTAACGAGTA CATGGACATG 2160 

AAACCCGGAG TGTCTTATGT GGTACCAACC AAGGCAGACA AAAGGAGATC TGCGAGAATA 222 0 

GGCTCATACA TAGAACGAGA TGTGACTCCT GCCATCATGG AAGATGATGA GTTGGCCCTA 22 80 

GACCTGGAGG ACTTGCTCAG CTTTTCTTAC CAAGTGGCAA AGGGCATGGC CTTCCTCGCC 2 34 0 

TCGAAGAATT GTATTCACAG AGACTTGGCG GCCAGAAATA TCCTCCTTAC TCATGGTCGA 2400 

ATCACAAAGA TTTGTGATTT TGGTCTAGCC AGAGACATCA AGAATGATTC TAATTACGTG 24 60 

GTCAAAGGAA ACGCTCGGCT ACCCGTGAAG TGGATGGCAC CTGAGAGCAT TTTCAACTGT 252 0 

GTCTACACAT TTGAAAGCGA TGTCTGGTCC TATGGGATTT TTCTGTGGGA GCTCTTCTCT 2580 

TTAGGGAGCA GCCCCTACCC CGGAATGCCA GTTGATTCTA AATTCTACAA GATGATCAAG 264 0 

GAGGGTTTCC GAATGCTCAG CCCTGAGCAT GCACCTGCGG AAATGTATGA CATCATGAAG 2 700 

ACTTGCTGGG ATGCGGATCC CCTCAAAAGA CCAACGTTTA AGCAGATCGT GCAGCTGATT 2 760 

GAGAAGCAGA TTTCGGAGAG CACCAATCAC ATTTATTCCA ACTTAGCGAA CTGCAGCCCC 2 82 0 

CACCGGGAGA ACCCCGCGGT GGATCATTCT GTGCGGATCA ACTCCGTGGG CAGCAGTGCC 2 880 

TCCTCCACGC AGCCTCTGCT TGTCCACGAA GATGTCTGA 2 919 
(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 434 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

CTGCAGTGCT CCTGGTGAGG GGGGACGGGC GCTGGAGCCA GGCTGCGGGG CTGAGGGCAG 60 

TGGTGCCGTC CTGCGGCCCG GTTCCTACGT GGCTGGGCAG CCCCTTGGCA GAGAGGACGG 12 0 

GCCGGACATC TCTGAAGGTA TGGACGCTGG ACCCTCTGGG GCCCGACAGA GGAAGAGCCA 180 

GCACTTCCAG GAGGCATGGG GAGTGGGGGA GGCTGGAGAG ACGGCGGGGA GCGCCACCTC 240 



CATCCAGAGA CCACCACGCC CGCCTTTGGG GCGCGCTCTG GGGACTTTGC CCCCCACTGG 



300 



GGTGGGACGT GTGCGGGCAG AAGCTGTCCG GGTGTTGCTC ACTGCAGGAC CTCAGGGGAA 



360 



GGCCTTCGTG ACTGCTAGGA AGCAGGCGCA GCGCCCCGGC GGAGGGCGGG GCCCCTCTCT 



420 



TCTACGGCTC AGTG 



434 



(2) INFORMATION FOR SEQ ID NO : 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 433 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 49: 

CTGCAGTGCT CCTGGTGAGG GGGGCGGGCG CTGGAGCCAG GCTGCGGGGC TGAGGGCAGT 60 

GGTGCCGTCC TGCGGCCCGG TTCCTACGTG GCTGGGCAGC CCCTTGGCAG AGAGGACGGG 12 0 

CCGGACATCT CTGAAGGTAT GGACGCTGGA CCCTCTGGGG CCCGACAGAG GAAGAGCCGG 18 0 

CACTTCCAGG AGGCATGGGG AGTGGGGGAG GCTGGAGAGA CGGCGGGGAG CGCCACCTCC 24 0 

ATCCAGAGAC CACCACGCCC GCCTTTGGGG CGCGCTCTGG GGACTTTGCC CCCCACTGGG 3 00 

GTGGGACGTG TGCGGGCAGA AGCTGTCCGG GTGTTGCTCA CTGCAGGACC TCAGGGGAAG 3 60 

GCCTTCGTGA CTGCTAGGAA GCAGGCGCAG CGCCCCGGCG GAGGGCGGGG CCCCTCTCTT 42 0 

CTACGGCTCA GTG 433 
(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 434 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 50: 
CTGCAGTGCT CCTGGTGAGG GGGGACGGGC GCTGGAGCCA GGCTGCGGGG CTGAGGGCAG 



60 



TGGTGCCGTC CTGCGGCCCG GTTCCTACGT GGCTGGGCAG CCCCTTGGCA GAGAGGACGG 12 0 

GCCGGACATC TCTGAAGGTA TGGACGCTGG ACCCTCTGGG GCCCGACAGA GGAAGAGCCG 180 

GCACTTCCAG GAGGCATGGG GAGTGGGGGA GGCTGGAGAG ACGGCGGGGA GCGCCACCTC 24 0 

CATCCAGAGA CCACCACGCC CGCCTTTGGG GCGCGCTCTG GGGACTTTGC CCCCCACTGG 300 

GGTGGGACGT GTGCGGGCAG AAGCTGTCCG GGTGTTGCTC ACTGCAGGAC CTCAGGGGAA 3 60 

GGCCTTCGTG ACTGCTAGGA AGCAGGCGCA GCGCCCCGGC GGAGGGCGGG GCCCCTCTCT 42 0 

TCTACGGCTC AGTG 40 a 



